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This application is a Continuation-in-Part Application of U.S. Pat. Appl. Ser. 
No. 08/951,621, filed October 16, 1997, pending, which is hereby incorporated herein 
by reference in its entirety. This invention was made with government support under 
National Institutes of Health Grant No. R01AG1 1085. The Government has certain 
rights in the invention. 

FIELD OF THE INVENTION 

The present invention provides compositions and methods for identification of 
F-box proteins, as well as for drug discovery and assessment. In particular, the present 
invention provides components of an E3 complex involved in ubiquiiination of cell 
cycle regulators and other proteins., as well as members of a class of proteins that 
directly function in recognition of ubiquitination targets. 

BACKGROUND OF THE INVENTION 

The proper development and maintenance of a multicellular organism is a ' 
complex process that, requires precise spatial and temporal control of cell proliferation. ■ 
Cell proliferation is controlled via an intricate network of extracellular and intracellular 
signaling pathways that process growth regulatory signals. This signaling network is 
superimposed upon the basic cell cycle regulatory machinery that controls particular, 
cell cycle transitions. In eukaryotes. the cell cycle is comprised of an ordered series of 
discrete. events., .In contrast to the periodicity of eukaryotic DNA replication and 
mitosis, cellular growth requires that most metabolic reactions occur continuously. 
The cell cycle regulatory machinery c6ordinates the events that occur during the cell 
cycle, as well as cell growth. Protein degradation is an important aspect of the 
development, and maintenance of multicellular organisms, as it provides direction, 
order, and the appropriate timing for the key events that occur during the cell cycle. 

The problem of how cell division is controlled has long been a topic of intense 
researc-h^-Early-models-suggested-theexistence-of-an-initiator-that-wouId-accumulate- 
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during the cell cycle, and induce DNA replication or mitosis when it reached a critical 
concentration. The mitotic process would then inactivate the initiator, thereby 
"resetting" the cell cycle. Subsequent research showed that mitotic cyclins accumulate 
during interphase to drive entry of cells into mitosis. These cyclins are then degraded 
at the end of mitosis, in order to reset the cycle. Protein degradation has been shown 
to have a pervasive role in the regulation of cell cycle progression. For example, 
proteolysis is required for multiple mitotic processes, and for initiating DNA 
replication (See.- King et al. f Science 274:1652-1659 [1996]). Nonetheless, much 
remains unknown regarding the proteins and the interactions that are involved in the 
proteolytic regulation of the cell cycle and other processes. Indeed, many proteins are 
likely to be involved in-proteolysis and cellular maintenance (as well as other 
processes). Such information is needed for the development of compounds to regulate 
the cell cycle and prevent or treat diseases'associated with abnormal cell proliferation. 

SUMMARY OF THE INVENTION 

The present invention provides compositions and methods for gene 
identification {e.g., F-box genes)., as well-as drug discovery, and assessment. The . 
present invention provides components of an E3 complex involved in ubiquitination. of 
cell cycle regulators and other proteins, as well as members of a class of proteins that 
directly function in recognition of ubiquitination targets. 

Thus, the present invention provides the function of a class of proteins referred 
to as F-box proteins in targeted ubiquitination. The present invention finds utility in 
methods for developing compounds that affect ubiquitination. The present invention 
also provides numerous hovel F-box containing mammalian genes whose encoded 
proteinsare contemplated" to function in processes including, but not- limited, to- - 
targeted ubiquitination of cellular proteins. 

The present invention also provides amino acid and DNA sequence information 
for eighteen novel F-box-containing human or mouse genes. As with Cdc4, Grrl, 
Skp2, and cyclin F, these novel F-box proteins have the capacity to associate with 
Skpl-and-to-simultaneousl-y-ih^ othetLP_rotein-pr otein 



WO 99/18989 



PCT/US98/21763 



interaction motifs encoded by regions of their genes other than the F-box. Thus, the 
present invention provides compositions and methods for determining the interaction of 
these proteins with other proteins. 

In one embodiment, the present invention provides an isolated polypeptide 
comprising at least one functionally active fragment of an F-box protein. In a 
preferred alternative embodiment, the F-box protein is mammalian, while in a 
particularly preferred embodiment, the F-box protein is human or murine. 

In another embodiment, the functionally active fragment comprises the amino 
acid sequence selected from the amino acid sequences set forth in SEQ ID NOS:I, 3, 
5, 9, 13, 17, 19, 25, 27, 41, 45, 47, 51, 53, 55, and 57, while in alternative 
embodiment, the functionally active fragment comprises the amino. acid .sequence 
selected from the amino acid sequences set forth in SEQ ID NOS:7, 11. 15. 21, 23, 
29, 31, 33, 35. 37.. 39, 43 ; and 49. 

The present invention also provides a purified antibody .which binds specifically 
to the isolated polypeptide encoding an F-box protein. In one embodiment, the 
antibody is monoclonal, while in another embodiment, the antibody is polyclonal. In 
another embodiment;" the present invention provides a purified antibody which 
specifically binds to a complex comprised of an F-box protein and an F-box protein 
target. In yet another embodiment, the present invention provides an antibody which 
specifically binds to a complex comprised of an F-box protein and Skpl: it is 
contemplated that the Skpl in the complex may be bound to another protein, but such 
binding is not required. 

The present invention also provides an isolated nucleotide sequence encoding at 
least one functionally active fragment of an F-box protein, wherein the nucleotide 
sequence encodes at least a portion of an F-box protein. In a preferred embodiment, 
the F-box protein is mammalian, while in particularly preferred embodiments, the F- 
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box protein is human or murine. In one embodiment, the isolated nucleotide sequence, 
comprises at least a portion of the sequence set forth in SEQ ID NOS:2, 4. 6, 10, 14, 
18, 20, 26, 28, 42, 48, 52, 54, 56, and 58. In another embodiment, the isolated 
nucleotide sequence comprises at least a portion of the sequence set forth in SEQ ID 
NO:8, 12, 16, 22, 24, 30, 32, 34, 36, 38, 40, 44, and 50. 

The present inyention also provides a vector comprising a nucleotide sequence, 
wherein the nucleotide sequence comprises the nucleotide sequence encoding at least 
one functionally active fragment of an F-box protein, wherein the nucleotide sequence 
encodes at least a portion of an F-box protein. In one preferred embodiment, the 
isolated nucleotide sequence comprises at least a portion of the sequence set forth in 
SEQ ID NOS:2, 4" ? 6, 10, 14, 18, 20, 26, 28, 42, 48, 52, 54, 56, and 58, while in 
another, preferred embodiment, the isp fated nucleotide sequence comprises at least a 
portion of the sequence set forth in SEQ ID NO:8, 12. 16. 22, 24, 30. 32. 34, 36, 38. 
40, 44, and 50. 

The present invention also provides a host cell transformed with at least one 
vector comprising a nucleotide sequence, wherein the nucleotide sequence comprises 
the nucleotide sequence encoding at least" one - functionally active fragment of ah F-box 
protein, wherein the nucleotide sequence encodes at least a portion of an F-box 
protein. In one preferred embodiment, the isolated nucleotide sequence comprises at 
least a portion of the sequence set forth in SEQ ID NOS;2, 4,' 6, 10. 14. 18, 20, 26. 
28. 42, 48, 52. 54, 56,, and 58. while in another preferred embodiment, the isolated 
nucleotide sequence comprises at least a portion of the sequence set forth in SEQ ID 
NO:8. 12, 16. 22. 24, 30. 32. 34, 36. 38, 40, 44, and 50. 

The present invention also provides an isolated nucleotide sequence encoding 
the amino acid sequence selected from group consisting of SEQ ID NOS:l, 3. 5, 7, 9, 
11; 13,15 ,17, 19, 21, 23, 25, 27, '29. 31, 33, 35, 37. 39, 41. 43, 45, 47. 49, 51, 53, 
55, and 57. In one embodiment, the present invention provides a vector comprising an 
isolated nucleotide sequence encoding the amino acid sequence selected from group 
consisting of SEQ ID NOS:l, 3, 5,7, 9, 1 1, 13,15 ,17, 19, 21, 23, 25. 27, 29, 31, 33, 
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35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, and 57. In another embodiment, the preseat 
invention provides a host cell transformed with this vector. 

The present invention further provides a polynucleotide sequence comprising at 
least fifteen nucleotides, which hybridizes under stringent conditions to at least a 
portion of a polynucleotide sequence, wherein the polynucleotide sequence is selected 
from the polynucleotide sequences set forth in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 
18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, and 58. 

The present invention also provides methods for detection of polynucleotides 
encoding F-box protein in a biological sample comprising the steps of: hybridizing at 
least a portion of the polynucleotide encoding an F-box protein, to nucleic acid 
material of a biological sample, thereby forming a hybridization complex; and 
detecting the hybridization complex, wherein the presence of the complex correlates 
with the presence of a polynucleotide encoding F-box protein in the biological sample. 
In one embodiment of the method, prior to hybridization, the nucleic acid material of 
the biological sample is amplified by the polymerase chain reaction. 

The present invention also provides methods for the detection of F-box protein 
targets comprising tlTe steps of: providing an F-box protein, and a sample suspected of 
containing an F-box protein target; exposing the F-box protein to the sample under 
conditions such that the F-box protein binds to the F-box protein target to form an F- 
box protein and target complex; and detecting the F-box protein and target complex. 
In one embodiment of the method, the box protein target is selected from the group 
consisting of cyclins. cyclin-dependent kinases, and IkB. An alternative embodiment 
further comprises the step of analyzing said F-box protein and target complex, wherein 
the analyzing comprises* observing the F-box protein and target complex for 

degradation of the F-box protein-target. - In another embodiment. -the-method further t - 

comprises the step of exposing the F-box protein and F-box protein target to an F-box 
protein antagonist. In yet another embodiment of the method, the F-box protein 
antagonist prevents the formation of the F-box protein and the target complex. 

The present invention also provides methods for the detection of an F-box 
protein and Skpl complex, comprising the ste ps of: providin g an F-box protein, and 
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Skpl; exposing the F-box protein to Skpl under conditions such that the F-box protein 
binds to Skpl to form an F-box protein and Skpl complex; and detecting the F-box 
protein and Skpl complex. One embodiment of the method further comprises the step 
of exposing the F-box protein and Skpl to an F-box protein antagonist. In yet 
another embodiment of the method, the' F-box protein antagonist prevents the 
formation of the F-box protein and Skpl complex. 

The present invention also provides methods and compositions useful to 
determine the complexity and diversity of mammalian F-box proteins, as well as the 
identity of F-box proteins from . various species, the protein-protein interaction domains 
involved, the proteolytic pathways, and regulatory pathways. Indeed, the present 
invention provides methods and compositions to identify the functions and 
ubiquitination targets of these and other F-box containing proteins. 

However, the present invention is not limited to F-box proteins involved in 
ubiquitination. Thus,, the' function of F-box proteins is not necessarily iimited to 
ubiquitination, and the present invention provides' the methods and compositions to 
make this determination. It is contemplated that additional F-box containing genes 
will be discovered through the use .of two-hybrid screens with* Skpl or ubiquitination 
targets as the two-hybrid "bait" (e.g., as described in the Example 6). It is also 
contemplated that additional F-box genes will be discovered through sequencing of the 
mammalian genome and sequence analysis, to determine the homology with existing 
F-box proteins, such as those identified in the present invention. 

The present invention also provide compositions and methods for development 
of drugs that disrupt at least one pathway in which F-box proteins, function, and are 
required for biological and/or biochemical processes. 

The present invention also provides methods and- compositions to identify 
and/or investigate cell cycle regulators, transcription regulators, proteins involved in 
DN A replication, and other cellular regulatory proteins. It is* further contemplated that 
the. present invention finds use in elucidating inflammatory response and infectious 
disease processes involving protein degradation, as well as development of compounds 
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that control (i.e., either enhance or retard) protein degradation, as appropriate to 
ameliorate the effects of the inflammatory response or disease process. 

The present invention also provides methods and compositions for identifying 
and investigating the function of protein targets whose abundance is altered in disease, 
as well as for detection, identification, and characterization of mutations in F-box 
genes through various methods, including, but not limited sequence analysis, Southern 
blot analysis of DNA, etc. Furthermore, the present invention also finds use in 
assessing alterations in cellular protein abundance due to overexpression of particular 
F-box proteins. It is contemplated that such alterations are associated with particular 
diseases. The present invention also finds use in determination of overexpression 
caused by gene amplification in DNA samples from diseased tissue or individuals 
through such methods as Southern analysis using a particular F-box gene as probe. 

It is also contemplated that targets of novel human F-box proteins will be 
determined by those experienced in the art by approaches including, but not limited to 
two-hybrid library screens, immunoprecipitation analysis followed by immunoblotting 
with antibodies against candidate targets, peptide mapping, mass spectral analysis, 
peptide sequencing"," and/or by -screening lambda based expression libraries with F-box 
protein probes. 

In addition, the present invention finds use in engineering F-box proteins to 
artificially recruit particular proteins into an E3 complex for ubiquitination. Thus, it is 
clear that the present invention provides methods and compositions for detailed 
investigation of F-box proteins, as well as proteins that associate with F-bqx proteins. 
Furthermore, the present invention thereby provides methods and compositions for the 
detection and analysis of abnormalities in proteolytic functions, as well as methods and 
compositions for the development of compounds suitable for use in ameliorating such " 
abnormalities. 

The present invention further provides a method for the detection of one or 
more NF-kB regulatory factors comprising the steps of: providing a slimb protein, and 
a sample suspected of containing one or more NF-jcB regulatory factors; and exposing 
the-slimb-protein^ 
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the one or more NF-kB regulatory factors to form a slimb/regulatory factor complex. . 
In some preferred embodiments, the method further comprises the step of detecting the 
slimb/regulatory factor complex. In other embodiment, the method further comprises 
the step of observing the slimb/regulatory factor complex for degradation of the one or 
5 more NF-kB regulatory factors. In yet other embodiments, the method further 

comprises the step of exposing the slimb protein and one or more NF-kB regulatory, 
factors to an F-box protein antagonist. In particular embodiments, the F-box protein 
antagonist prevents the formation of the slimb/regulatory factor complex. 

The present invention also provides a method for the detection of a siimb 
10 protein complex, comprising the steps of: providing a slimb protein and a sample 

suspected of containing one or more proteins capable of. forming a complex with the 
slimb protein: and exposing the slimb protein to the one or more proteins capable of 
forming a complex with the slimb protein under conditions such that the slimb protein 
binds to the one or more proteins capable of forming a complex with the slimb protein 
15 ' to form a slimb protein complex. In preferred embodiments, the method further 

comprises the step of detecting the slimb protein complex. In some embodiment, step 
b) of the method further comprises exposing the slimb protein and the one or more 
proteins capable of forming a complex with the slimb protein to an F-box protein 
antagonist. In particularly preferred embodiments, the F-box protein antagonist 
20 prevents the formation of the slimb protein complex. 

. DESCRIPTION OF THE FIGURES 

Unless otherwise indicated, a "P" enclosed within a circle indicates that the 
protein associated with the symbol is phosphorylated. 

Figure 1 shows the" assembly of amultiprotein complex containing Cdc34,.. . _ 
25 Cdc53. SkpL and Cdc4, with the three panels showing the enhancement of the 
formation of a Cdc53/Cdc4 complex by Skpl. Panel A shows the results of 
immunoprecipitation with Myc tag on Cdc53 (Cdc53 M ) using anti-Myc antibodies. 
Panel B shows the immunoprecipitation results with a Flag tag on Cdc4 (Cdc4 F ). 
p - ffrt - el - c - shows - that _ Sfc ^^ -sjrrwltaneously. 

- 8 - 
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Figure 2A shows an SDS-PAGE analysis of purified Clnl HA/Gst- 
Cdc28HA/Cksl. 

Figure 2B is an autoradiograph showing the phosphorylation of Sicl by 
Clnl/Cdc28 complexes in vitro. 

Figure 2C shows immunoblot results indicating that phosphorylation of Sicl is 
required for its association with Cdc34/Cdc53/Skpl/Cdc4 complexes. 

Figure 2D shows immunoblot results indicating that association of 
phosphorylated Sicl with Cdc4 is enhanced by Skpl. 

Figure 2E shows immunoblot results indicating that association of ; 
phosphorylated Sicl with Skpl requires the WD-40 repeats of Cdc4. 

* Figure 3A shows the immunoblot results demonstrating that, phosphorylated 
Sicl is ubiquitinated in vivo and in vitro with purified Cdc34 E2 and 
Cdc53/Skpl/Cdc4 complexes. 

Figure 3B shows immunoblot results for anti-Cdc53 M immune complexes tested 
against Cdc53 M /Skpl, Cdc53 M /Skpl/Cdc4, and supplemented with ATP ? ubiquitin, 
human EL Cdc34 purified from £. cdli, and either unphosphorylated or phosphorylated 

Sicl complexes. " : - - - . 

Figure 3C shows immunoblot results for anti-Skpl'' immune complexes tested 
with .Skpl 7Cdc53 M /Cdc4 ? Skpl F /Cdc4, and Skpl F /Cdc53 M 

Figure 3D shows immunoblot results that indicate ubiquitination of Sicl does, 
not require that Cln/Cdc28 be present in the ubiquitination reaction nor that Sicl be 
associated with Clb5/Cdc28. 

Fieure 3E shows immunoblot results that Clb5/Cdc28-phospHorylated Sicl is a 
substrate for ubiquitination by Cdc34. 

Figure 4 A shows immunoblot results indicating that Grrl can associate with 

Skpl and Cdc53. 

Figure 4B shows an autoradiograph indicating that phosphorylated Sicl 
associates with Cdc4 but not Grrl -containing complexes. 

•Figure 4C shows an immunoblot indicating that Cdc4 ? but not Grrl, supports 
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Figure 4D shows an immunoblot used to verify the presence of reaction 
components derived from immunoprecipitation (the blot used for ubiquitination assays 
was reprobed to detect GrrlGlO, Cdc53 M , and Cdc4). 

Figure 5A is an autoradiograph showing differential recognition of Sicl and 

Cln proteins by Grrl and Cdc4. 

Figure 5B shows an immunoblot verifying the presence of Cdc4, GrrlGlO, 

Cdc53 M , and Skpl F . 

Figure 5C is an immunoblot indicating that phosphorylation of Cln is required 
for the association of Clnl/Cdc28 complexes with Grrl. 

Figure 5D is an autoradiograph showing that purified Skpl/Cdc53/Grrl 
complexes are not sufficient for Clnl ubiquitination by Cdc34 in vitro. 

Figure 5E is an immunoblot showing that phosphorylated Clnl is ubiquitinated 

in a fractionated yeast extract system. 

Figure 6A is a schematic showing that phosphorylation of substrates through 
protein kinase signalling pathways is . required for recognition by F-box receptor 
proteins. 

Figure 6B fe'a schematic showing that distinct F-box complexes may regulate 
different biological processes through selective recruitment of substrates. Hypothetical 
FEC configurations are shown together with the signals that are being sensed, the 
corresponding substrates and the physiological consequences of complex function. 

Figure 6C is a schematic showing the interplay between protein kinase and the 
SCF pathway in the Gl to S-phase transition in 5. cerevisiae. In this Figure, 
. perpendicular, bars indicate inhibitory events. 

Figure 7 shows the alignment of various F-box proteins provided in the present 

invention. 

Figure 8 shows immunoblotting results demonstrating that Skpl associates with 
phosphorylated IkB but not unphosphorylated IkB. 

Figure 9 shows the interaction between various F-box proteins and 
phosphorylated and unphosphorylated IkB. Schematic representations of the F-box 

— proteins- are-provided-with ^'-representing Evbox-sequences, : 

- 10 - . 
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Figure 10A and 10B show lysates from the indicated transfections that were , 
subjected to immunoblotting using the indicated antibodies. 

Figure 11A and 1 IB show additional immunoprecipitation/western blotting 
experiments using the indicated transfections and antibodies. 
5 Figure 12 shows immunoblotting results with the indicated antibodies for 

phosphorylation-specific interaction of SCF slimb complexes with IkB peptide 
sequences. 

DEFINITIONS 

To facilitate understanding of the invention, a number of terms are defined 

10 below. 

As used herein, the term "F-box proteins" refers to the amino acid sequences of 
substantially purified proteins involved in proteolysis, including but not limited to 
proteins involved in the ubiquitin-ligase complex obtained from any species, including 
bovine, ovine, porcine, murine, equine, and human, from any source whether natural, 

15 . synthetic, semi-synthetic, or recombinant. The F-box is a sequence of 35-45 amino 
acids and allows the~F-box proteins "to enter -into complexes with Skpl. Thus, the F- 
box proteins bind Skpl, and contain a motif that displays sequence similarity to Grrl 
and Cdc4. This conserved structural motif is included in the sequence alignments 
shown in Figure 7 (i.e.. the amino acid residues that are shared by the F-box proteins 

20 shown). However, it is not intended that the term be limited to the exact sequences set 

forth in Figure 7. In some embodiments, the F-box proteins further comprise 
additional motifs, in particular motifs involved in protein-protein interaction. These 
additional motifs included, but are not limited to leucine-rich repeats, and WD-40. In 
preferred embodiments," the F-box- protein is mammalian, while in particularly 

25 preferred embodiments, the F-box protein is human or murine. 

As used herein, the term "F-box target" refers to any moiety that is recognized 
by at least one F-box containing protein. It is intended that the term encompass such 
proteins as the cyclins {e.g.. A, D, and E), as well as cyclin kinase inhibitors {e.g., 
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pin and IkB, as well as other proteins. It is not intended that the term be limited to, 
any particular protein or compound. 

As used herein, the term "multiprotein complex" refers to complexes 
comprising more than one protein. It is intended that the term encompass complexes 
5 with any number of proteins. In preferred embodiments, the proteins comprising a 

multiprotein complex function cooperatively. For example, in particularly preferred 
embodiments of the present invention, Cdc34, Cdc53, Skpl , and Cdc4 comprise a 
multiprotein complex. It is also intended that the term encompass complexes 
comprising Skpl, any of the amino acid sequences set forth in Table 2 or Table 4, and 
10 a Cdc53 homolcg. In preferred embodiments, the Cdc53 homolog in such multiprotem 

complexes comprises human Cul proteins (e.g.. Cul 1 through 5). as well as murine 
Cul proteins. . It is also intended that this term encompass complexes comprised of an 
F-box protein and its target protein (i.e.. an F-box target protein). 

The term "modulate;- as used herein, refers to a change or an alteration in the 
15 biological activity of an F-box protein (e.g., mammalian F-box. proteins). Modulation 

may be an increase or a decrease, in protein activity, a change in binding 
characteristics, or any other change in the biological, functional, or immunological 

properties of an F-box protein. 

The term "mimetic," as used herein, refers to a molecule, the structure of which 
20 . is developed from knowledge of the structure of an F-box protein, or portions thereof 
and. as such, is able to effect some or all of the actions of F-box proteins and/or F-box 

protein-like molecules. 

The term "antagonist" refers to molecules or. compounds which inhibit the 
action of a composition (eg., an F-box protein). Antagonists may or may not be 

25~ homologous to the targets of these compositions in respect to conformation, charge or 
other characteristics. In particularly preferred embodiments, antagonists prevent the 
functioning of F-box proteins. It is contemplated that antagonists may prevent binding 
of an F-box protein and its target(s). It is also contemplated that antagonists prevent 
or alter the binding of an F-box protein and Skpl. However,.it is not intended that the 

-~30 terln~b^iim(ea"to^-particulaMite-of-function 
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The term "derivative," as used herein, refers to the chemical modification of a. 
nucleic acid encoding an F-box protein (in particular, mammalian F-box proteins), or 
the encoded F : box protein. Illustrative of such modifications would be replacement of 
hydrogen by an alkyl, acyl, or amino group. A nucleic acid derivative would encode a 
polypeptide which retains essential biological characteristics of the natural molecule. 

A "variant" of an F-box protein, as used herein, refers to an amino acid 
sequence that is altered by one or more amino acids. The variant may have 
"conservative" changes, wherein a substituted amino acid has similar structural or 
chemical properties (e.g., replacement of leucine with isoleucine). More rarely, a 
variant may have "nonconservative" changes (e.g., replacement of a glycine with a 
tryptophan). Similar minor variations may also include amino acid deletions or 
insertions, or both. Guidance in determining which amino acid residues may be 
substituted, inserted, or deleted without abolishing biological or immunological activity 
may be found using computer programs well known in the art. for example, 

DNAST-AR software. 

. "Alterations" in the polynucleotide of for example, SEQ ID NO:4, as used 

' herein, comprise any alteration- in the sequence of polynucleotides encoding human Fl 
Alpha F-box protein, including deletions, insertions, and point mutations that may be 
detected using hybridization assays. Included within this definition is the detection of 
alterations to the genomic DNA sequence which encodes an F-box protein (e.g.," by 
alterations in the pattern of restriction fragment length polymorphisms) capable of 
hybridizing to a particular sequence, the inability of a selected fragment to hybridize to 

* a sample of genomic DNA (e.g.. using allele-specific oligonucleotide probes), and 
improper or unexpected hybridization, such as hybridization to a locus other than the 
normal chromosomal locus for the "polynucleotide sequence encoding an F-box .protein 

- (e.g.. using fluorescent in situ hybridization [FISH] to metaphase chromosomes 
spreads). 

; A "consensus gene sequence" refers to a gene sequence which is derived by 
comparison of two or more gene sequences and which describes the nucleotides most 
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often present in a given segment of the genes; the consensus sequence is the canonical ^ 
sequence. In some embodiments, "consensus," refers to a nucleic acid sequence which 
has been resequenced to resolve uncalled bases, or which has been extended using any 
suitable method known in the an, in the 5* and/or the 3' direction and resequenced, or 
5 which has been assembled from the overlapping sequences of more than one clone 

using any suitable method known in the art, or which has been both extended and 
assembled. 

The term "sample," as used herein, is used in its broadest sense. The term • 
encompasses biological sample(s) suspected of containing nucleic acid encoding F-box 
10 proteins or fragments thereof, and may comprise a cell, chromosomes isolated from a 

cell (e.g..- a spread of metaphase chromosomes), genomic DNA (iri. solution or bound 
to a solid support such as for Southern analysis), RNA (in solution or bound to a solid 
support such as for northern analysis), cDNA (in solution or bound to a solid support), 
an extract from cells or a tissue, and the like. 
l5 As used herein the terms "protein" and "polypeptide" refer to compounds 

. comprising amino acids joined via peptide bonds and are used interchangeably. 

The terms "gene sequences" or "native gene sequences" are used to indicate 
DNA sequences encoding a particular gene which contain the same DNA sequences as 
found in the gene as isolated from nature. In contrast, "synthetic gene sequences" are 
2D DNA sequences which are used to replace the naturally occurring DNA sequences 

when the naturally occurring sequences cause expression problems in a given host cell. 
For example, naturally-occurring DNA sequences encoding codons which are rarely 
. used in a host cell may be replaced (e.g.. by site-directed mutagenesis) such that the 
synthetic DNA sequence represents a more frequently used codon. The native DNA 
25 sequence and the synthetic DNA sequence will preferably encode, the same jimiho acid 

sequence. 

As used herein, the term "gene" means. the deoxyribonucleotide sequences 
comprising the coding region of a structural gene and the including sequences located 
adjacent to the coding region on both the 5, and 3, ends for a distance of about 1 kb 
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The sequences which are located 5' of the coding region and which are present on the, 
mRNA are referred to as 5' non-translated sequences. The sequences which are 
located 3' or downstream of the coding region and which are present on the mRNA . 
are referred to as 3' non-translated sequences; these sequences. The term "gene- 
encompasses both cDNA and genomic forms of a gene. A genomic form or clone of a 
gene contains the coding region interrupted with non-coding sequences termed 
"introns" or "intervening regions" or "intervening sequences." Introns are segments of 
a gene which are transcribed into nuclear RNA (hnRNA); introns may contain 
regulatory elements such as enhancers. Introns are removed or "spliced out" from the 
nuclear or primary transcript; introns therefore are absent in the messenger RNA 
(mRNA) transcript. The mRNA functions during translation to specify the sequence or 
order of amino acids in a nascent polypeptide. 

In addition to containing introns, genomic forms of a gene may also include 
sequences located on both the 5' and 3' end of the sequences which are present on the 
RNA transcript. These sequences are referred to as "flanking" sequences or regions 
(these flanking sequences are located 5' or 3' to the non-translated sequences present 
on the mRNA transcript). The- 5' flanking region may contain regulatory sequences 
such as promoters and enhancers which control or influence the transcription of the 
gene. The 3 : flanking region may contain sequences which direct the termination of 
transcription, post-transcriptionat cleavage and polyadenylation. 

As used herein, the term "structural gene" refers to a DNA sequence coding for 
RNA or a protein. In contrast, "regulatory genes" are structural genes which encode 
products. which control the expression of other genes {e.g.. transcription factors). 

As used herein the term "coding region" when used in reference to structural 
gene refers to the nucleotide sequences which encode the amino acids found in the 
nascent polypeptide as a result of translation of a mRNA molecule. The coding region 
is bounded, in eukaryotes, on the 5' side by the nucleotide triplet "ATG" which 
encodes the initiator methionine and on the 3' side by one of the three triplets which 
specify stop codons (i.e., TAA, TAG, TGA). 
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The term "portion," as used herein, with regard to a protein (as in "a portion of 
a given protein") refers to fragments of that protein. The fragments may range in size 
from four amino acid residues to the entire amino acid sequence minus one amino 
acid. Thus, a protein "comprising at least a portion of the amino acid sequence of 
SEQ ID NO:3" encompasses the full-length human Fl protein, and fragments thereof. 

"Nucleic acid sequence" as used herein refers to an oligonucleotide, nucleotide, 
or polynucleotide, and fragments or portions thereof, and to DNA or RNA of genomic 
or synthetic origin which may be single- or double-stranded, and represent the sense or 
antisense strand. Similarly, "amino acid sequence" as used herein refers to an 
oligopeptide, peptide, polypeptide, or protein sequence, and fragments or portions 
thereof, and to naturally occurring or synthetic molecules. 

A "composition comprising'a given polynucleotide sequence" as used herein 
refers broadly to any composition containing the given polynucleotide sequence. The 
composition may comprise an aqueous solution. Compositions comprising 
polynucleotide sequences encoding F-box proteins or fragments thereof, may be 
employed as hybridization probes.. .In this case, the F-box-encoding polynucleotide 
sequences are typically employed in an aqueous solution containing salts (e.g., NaCl), 
determents (e.g., SDS) and other components (e.g., Denhardt's solution, dry milk, 
salmon sperm DNA, etc.). 

Where "amino acid sequence" is recited herein to refer to an amino acid 
sequence of a naturally occurring protein molecule, "amino acid sequence" and like 
terms, such as "polypeptide" or "protein" are not meant to limit the amino. acid 
sequence to the complete, native amino acid sequence associated with the recited 

. protein molecule. 

A "deletion." as used herein, refers to a change in either amino acid or 
nucleotide sequence in which one or more amino acid or nucleotide residues, 
respectively, are absent. 

An "insertion" or "addition," as used herein, refers to a change in an amino acid 
or nucleotide sequence resulting in the addition of one or more amino acid or 
— nucl'eotrde-residuesrrespectively-as cDmpared-to-the naturally-occumng_mole.cule. 
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A "substitution," as used herein, refers to the replacement of one or more 
amino acids or nucleotides by different amino acids or nucleotides, respectively. 

The term "biologically active," as used herein, refers to a protein having 
structural, regulatory, or biochemical functions of a naturally occurring molecule. 
Likewise, "immunologically active" refers to the capability of the natural, recombinant, 
or synthetic F-box proteins, or any oligopeptide thereof, to induce a specific immune 
response in appropriate animals or cells and' to bind with specific antibodies. 

As used" herein, the term "purified" or "to purify" refers to. the removal of 
contaminants from a sample. For example, proteins of interest are purified by 
removal of contaminating proteins; they are also purified by the removal of 
substantially all proteins that are not of interest. The removal of non-immunoglobulin 
proteins and/or the Temovalof. immunoglobulins..** do. not bind protein results in an 
increase in the percent of protein of interest-reactive immunoglobulins in the sample. 
In another example, recombinant polypeptides are expressed in bacterial host cells and 
the polypeptides are purified by the removal of host cell proteins: the percent of 
recombinant polypeptides is thereby increased in the sample. 

The term "substantially purified," 'as used herein, refers to nucleic or amino 
acid sequences that are removed from their natural environment, isolated or separated, 
and are at least 60% free, preferably 75% free, and most preferably 90% free from 
other components with which they are naturally associated. 

The term "recombinant DNA molecule" as used herein refers to a DNA 
molecule which is comprised of segments of DNA joined together by means of 
molecular biological techniques. 

The term "recombinant protein" or "recombinant polypeptide" as used herein 
refers to a protein molecule which is expressed from a recombinant DNA molecule. 
The term "native protein" as used herein refers to a protein which is isolated from a 
natural source as opposed to the production of a protein by recombinant means. 

As used herein, the term "overproducing" is used in reference to the production 
of polypeptides in a host cell, and indicates that the host cell is producing more of the 

poly^eptide_by_v|nu^onheJii^du^ 
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polypeptide than would be expressed by the host cell absent the introduction of these, 
nucleic acid sequences. To allow ease of purification of polypeptides produced in a 
host cell it is preferred that the host cell express or overproduce the polypeptide at a 
level greater than 1 mg/liter of host cell culture. 

"A host cell capable of expressing a recombinant protein as a soluble protein at 
a level greater than or equal to X milligrams per 1 OD of cells per liter" is a host cell 
that produces X milligrams of recombinant protein per liter of culture medium 
containing a density of host cells equal to 1 OD 600 . The amount of recombinant 
■ protein present per OD per liter is determined by quantitating the amount of 
recombinant protein recovered following affinity purification. 

"A host ceil capable of secreting a recombinant protein into the culture 
supernatant at a level greater than or equal to 10 mg recombinant protein per 1 OD of 
cells per liter" refers to a host cell that secretes a recombinant protein into the culture 
supernatant (i.e.. the medium, such as LB broth," used to grow the host cell) at a level 
15 greater than or equal to 10 mg recombinant protein per liter of medium containing a 

concentration {i.e., density) of host cells equal to 1 OD S00 . The host cells may be 
grown in shaker flasks (approximately T liter culture medium) or in fermentation tank 
(approximately 10 liters culture medium) and the amount of recombinant protein 
secreted into the culture supernatant may be determined using a quantitative ELISA 
20 assay. 

As used herein, the term "fusion protein" refers to a chimeric protein containing 
the protein of interest (i.e., a ubiquitination complex and/or fragments thereof) joined 
to an exogenous protein fragment (the fusion partner which consists of a non- 
ubiquitination complex protein). The fusion partner may enhance solubility of the 
25 protein as expressed in a host cell,, may provide an "affinity tag" to allow purification 

of the recombinant fusion protein from the host cell or culture supernatant, or both. If 
desired, the fusion protein may be removed from the protein of interest prior to 
immunization by a variety of enzymatic or chemical means known to the art. 

As used herein, the term "affinity tag" refers to such structures as a "poly- 

"30 histiaine-tract"-or- M pol^^^^^ 
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facilitates the purification of a recombinant fusion protein from a host cell, host cell- 
culture supernatant, or both. As used herein, the term "flag tag" refers to short 
polypeptide marker sequence useful for recombinant protein identification and 
purification. 

As used herein, the terms "poly-histidine tract" and "poly-histidine tag," when 
used in reference to a fusion protein refers to the presence of two to ten histidine (or 
more) residues at either the amino- or carboxy-terminus of a protein of interest. A 
poly-histidine tract of six to ten residues is preferred. The poly-histidine tract is also 
defined functionally as being a number of consecutive histidine residues added to the 
protein of interest which allows the affinity purification of the resulting fusion protein 
on a nickel-chelate or IDA column. 

As used herein, the term "chimeric protein" refers to two or more coding 
sequences obtained from different genes, that have been cloned together and that, after 
translation, act as a single polypeptide sequence. Chimeric proteins are also referred to 
as "hybrid proteins." As used herein, the term "chimeric protein" refers to coding 
sequences that are obtained from different species of organisms, as well as coding 
sequences that are "obtained from -the same species of organisms. 

As used herein, the term "protein of interest" refers to the protein whose 
expression is desired within the fusion protein, in a fusion protein, the protein of 
interest will be joined or fused with another protein or protein domain, the fusion 
partner, to allow for enhanced stability of the protein of interest and/or ease of 
purification of the fusion protein. 

As used herein "soluble" when in reference to a protein produced by 
recombinant DNA technology in a host cell, is a protein which exists in solution in the 
cytoplasm of the host, cell; if the protein contains a signal sequence, the soluble protein- 
is secreted into the culture medium of eukaryotic cells capable of secretion or by 
bacterial hosts possessing the appropriate genes. In contrast, an insoluble protein is 
one which exists in denatured form inside cytoplasmic granules (/.e., inclusion bodies) 
in the host cell. High level expression (i.e., greater than 1 mg recombinant - 
"pi^eiMit^ro^ 
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being found in inclusion bodies in the host cells. A soluble protein is a protein which 
is not found in an inclusion body inside the host cell or is found both in the cytoplasm.* 
and in inclusion bodies and in this case the protein may be present at high or low 
levels in the cytoplasm. 

"Peptide nucleic acid" as used herein, refers to a molecule which comprises an 
oligomer to which an amino acid residue, such as lysine, and an amino group have 
been added. These small molecules, also designated anti-gene agents, stop transcript 
elongation by binding to their complementary strand of nucleic acid (Nielsen, P.E. et. 
al, Anticancer Drug Des., 8:53-63 [1993]). 

The term "hybridization" as used herein, refers to any process by which a 
strand of nucleic acid binds with a complementary strand through base, pairing. 
Hybridization and the strength of hybridization (i.e., the strength of the association 
between the nucleic acids) is impacted by such factors as the degree of complementary 
between the nucleic acids, stringency of the conditions involved, the T m of the formed 
hybrid, and the G:C ratio within the nucleic acids. 

As used herein, the term "T m " is used in reference to the "melting temperature." 
The melting temperature is the temperature at which a population of double-stranded - 
nucleic acid molecules becomes half dissociated into single strands. The equation for 
calculating the T m of nucleic acids is well known in the art. As indicated by standard 
references, a simple estimate of the T m value may be calculated by the equation; T m = 
81.5 -r 0.4 1(% G + C), when a nucleic acid is in aqueous solution at 1 M NaCl (See 
e.g., Anderson and Young, Quantitative Filter Hybridization, in Nucleic Acid 
Hybridization [1985]). Other references include more sophisticated computations 
which take structural as well as sequence characteristics into account for the 
calculation of T m . 

The term "hybridization complex," .as used herein, refers to a complex formed 
between two nucleic acid sequences by virtue of the formation of hydrogen binds 
between complementary G and C bases and between complementary A and T bases; 
these hydrogen bonds may be further stabilized by base stacking interactions. The two 
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complementary nucleic acid sequences hydrogen bond in an antiparallel configuration. 
A hybridization complex may be formed in solution (e.g., C 0 t or R^t analysis) or 
between one nucleic acid sequence present in solution and another nucleic acid 
sequence immobilized on a solid support (e.g., membranes, filters, chips, pins or glass 
slides to which cells have been fixed for in situ hybridization). 

The terms "complementary" or "complementarity" as used herein, refer to the 
natural binding of polynucleotides under permissive salt and temperature conditions by 
base-pairing: For example, for the sequence "A-G-T" binds to the complementary 
sequence "T-C-A". Complementarity between two single-stranded molecules may be 
"partial", in which only some of the nucleic acids bind, or it may be complete when 
total complementarity exists between the single stranded molecules. The degree of 
complementarity between nucleic acid strands has significant effects on the efficiency 
and streneth of hybridization between nucleic acid strands. This is of particular 
importance in amplification reactions, which depend upon binding between nucleic 
acids strands. 

The term "homology," as used herein, refers to a degree of complementarity. 
There may be partial homology or complete homology (/.<?.," identity). A partially 
complementary sequence is one that at least partially inhibits an identical sequence . 
from hybridizing to a target nucleic acid; it is referred to using the functional term 
"substantially homologous." The inhibition of hybridization of the completely 
complementary sequence to the target sequence may be examined using a hybridization 
assay (Southern or Northern blot, solution hybridization and the likeY under conditions 
of low stringency. A substantially homologous sequence or probe will compete for 
and inhibit the binding {i.e., the hybridization) of a completely homologous sequence 
or probe to the target sequence "under conditions of "low stringency. This is not to say- 
that conditions of low stringency are such that non-specific binding is permitted; low 
stringency conditions require that the binding of two sequences to one another be a 
specific (i.e.. selective) interaction. The absence of non-specific binding may be tested 
by the use of a second target sequence which lacks even a partial degree of 
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complementarity (e.g., -less than about 30% identity); in the absence of non-specific * 
binding, the probe will not hybridize to the second non-complementary target 
sequence. When used in reference to a single-stranded nucleic acid sequence, the term 
"substantially homologous' 1 refers to any probe which can hybridize (i.e., it is the 
complement of) the single-stranded nucleic acid sequence under conditions of low 
stringency as described. 

As known in the art, numerous equivalent conditions may be employed to 
comprise either low or high stringency conditions. Factors such as the length and 
nature (DNA, RNA, base composition) of the sequence, nature of the target (DNA, 
RNA, base composition, presence in solution or immobilization, etc.), and the 
concentration of the salts and other components {e.g., the presence or absence of 
formamide,.dextran sulfate and/or. polyethylene glycol) are considered and the 
hybridization solution may be varied to generate conditions of either low or high 
stringency different from, but equivalent to. the above listed conditions. 

As used herein the term "stringency" is used in reference to the conditions of 
temperature, ionic strength, and the. presence of other compounds such as organic - 
solvents, under which nucleic acid hybridizations are conducted. With "high 
stringency" conditions, nucleic acid base pairing will occur only between nucleic acid 
fragments that have a high frequency of complementary base sequences. Thus, 
conditions of "weak" or "low" stringency are often required with nucleic acids that are 
derived from organisms that are genetically diverse, as the frequency of 
complementary sequences is usually less. 

Low stringency conditions comprise conditions equivalent to binding or 
hybridization at 42°C in a solution consisting of 5X SSPE (43.8 g/1 NaCL 6.9 g/1 
NaH 2 P0 4 *H 2 0 and 1785 g/TEDTA, pH adjusted to 7.4 with NaOH) ? 0.1% SDS, 5X 
Denhardt's reagent (SOX Denhardt's contains per 500 ml: 5 g Ficoll (Type 400, 
Pharmacia), 5 g BSA [Fraction V; Sigma]) and 100 ^ig/ml denatured salmon sperm 
DNA followed by washing in a solution comprising 5X SSPE, 01% SDS at 42°C 
when a probe of about 500 nucleotides in length is employed. 
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The art knows well that numerous equivalent conditions may be employed to 
comprise low stringency conditions; factors such as the length and nature (DNA. RNA," 
base composition) of the probe and nature of the target (DNA, RNA, base 
composition, present in solution or immobilized, etc.) and the concentration of the salts 
and other components (e.g., the presence or absence of formamide, dextran sulfate, 
polyethylene glycol) are considered and the hybridization solution may be varied to 
- generate conditions of low stringency -hybridization different from, but equivalent to, 
the above listed conditions. In addition, the art lenows conditions which promote 
hybridization under conditions of high stringency (e.g.,. increasing the temperature of 
the hybridizaiion and/or wash steps, the use of formamide in the hybridization 
solution, etc.). 

The term "antisense/' as used herein, refers to nucleotide sequences which are 
complementary to a specific DNA or RNA sequence. The term "antisense strand' 1 is 
used in reference to a nucleic acid strand that is complementary to the "sense" strand. 
Antisense molecules may be produced by any method, including synthesis by ligating 
the gene(s) of interest in a reverse orientation to a viral promoter which permits the 
synthesis of a complementary* strand. Once introduced into a cell, this transcribed 
strand combines with natural sequences produced by the cell to form duplexes. These 
duplexes then block either the further transcription or translation. In this manner, 
mutant phenotypes may be generated. The designation "negative" is sometimes used in 
reference to the antisense strand, and "positive" is sometimes used in reference to the 
sense strand. 

The term also is used in reference to RNA sequences which are complementary 
to a specific RNA sequence (e.g., mRNA). Included within this definition are 
antisense RNA ("asRNA") molecules involved in gene regulation by bacteria. 
Antisense RNA may be produced by any method, including synthesis by splicing the 
gene(s) of interest in a reverse orientation to a viral promoter which permits the 
synthesis of a coding strand. Once introduced into an embryo, this transcribed strand 
combines with natural mRNA produced by the embryo to form duplexes. These 
-duple-xes-then^lock-eitte^ 
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this manner, mutant phenotypes may be generated. The term "antisense strand" is used 
in reference to a nucleic acid strand that is complementary to the "sense" strand. The 
designation. (-) (i.e., "negative") is sometimes used in reference to the antisense strand 
with the designation (+) sometimes used in reference to the sense {i.e.. "positive") 
strand. 

A gene may produce multiple RNA species which are generated by differential 
splicing of the primary RNA transcript. cDNAs that are splice variants of the same 
gene will contain regions of sequence identity or complete homology (representing the 
presence of the same exon orportion of the same exon on both cDNAs) and regions of 
complete non-identity (for example, representing the presence of exon "A" on cDNA 1 
wherein cDNA 2 contains exon "B" instead). Because the two cDNAs contain regions 
of sequence identity they will both hybridize to a probe derived from the entire gene 
or portions of the gene containing sequences found on both cDNAs; the two splice 
variants are therefore substantially homologous to such a probe and to each other. 

"Transformation/' as defined herein, describes a process by which exogenous 
DNA enters and changes a recipient cell. It may occur under natural or artificial 
conditions using various 1 methods well known in the art. Transformation may rely on 
any known method for the insertion of foreign nucleic acid sequences into a 
prokaryotic or eukaryotic host cell. The method is selected based on the host cell 
being transformed and may include, but is not limited to, viral infection, 
electroporation, lipofection. and particle bombardment. Such "transformed" cells 
include stably transformed cells in which the inserted DNA is capable of replication 
either as an autonomously replicating plasmid or as part of the host chromosome. 
The term "transfection" as used herein refers to the introduction of foreign DNA into 
eukaryotic cells. Transfection may be accomplished by a variety of means known to 
the an including calcium phosphate-DNA co-precipitation ? DEAE-dextran-mediated 
transfection, polybrene-mediated transfection, electroporation, microinjection, liposome 
fusion, lipofection, protoplast fusion, retroviral infection, and biolistics. Thus, the term 
"stable transfection" or "stably transfected" refers to the introduction and integration of 
"T^ignDN ^m<TtTie^nome "of "the traiisfected-cell— -T-he-tenn-stable. transfectanT 
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refers to a cell which has stably integrated foreign DNA into the genomic DNA. The 
term- also encompasses cells which transiently express the inserted DNA or RNA for 
limited periods of time. Thus, the term "transient transfection" or "transiently 
transfected" refers to the introduction of foreign DNA into a cell where the foreign 
DNA fails to integrate into the genome of the transfected cell. The foreign DNA 
persists in the nucleus of the transfected cell for several days. During this time the 
foreign DNA is subject to the regulatory controls that govern the expression of 
endogenous genes in the chromosomes. The term "transient transfectant" refers to' 
cells which have taken up foreign DNA but have failed to integrate this DNA. 

The term "correlates with expression of a polynucleotide," as used herein, 
indicates that the detection of the presence of ribonucleic acid that is similar to a 
particular nucleotide sequence by Northern analysis is indicative of the presence of 
mRNA encoding an F-box protein in a sample and thereby correlates with expression 
of the transcript from the polynucleotide encoding the protein. 

As used herein, the term "polyA r RNA" refers to RNA molecules having a 
stretch of adenine nucleotides at. the 3 J end. This poly.adenine stretch is also referred to 
as a "poly- A tail". EukaryoticmRNA molecules contain poly- A tails and are referred 
to as polyA" RNA. As used herein, the term "cell culture" refers to any in vitro 
culture of cells. Included within this term are continuous cell lines (e.g, with an 
immortal phenotype), primary ceil cultures, finite cell lines {e.g.. non-transformed 
cells), and any other cell population maintained in vitro. In particularly preferred 
embodiments, the cell cultures comprise insect cells. 

As used herein, the term "Baculoviridae" refers to the family of viruses that 
multiply only in invertebrates. Genera of viruses that multiply only in invertebrates are 
also included in other families Iridovirus [Iridoviridae], Entomopoxvirus 

[Poxviridae], Densovirus [Parvoviridae], cytoplasmic polyhedral virus group 
[Reoviridae], and Sigmavirus [Rhabdoviridae]). "Baculovirus" refers to viruses that 
infect insect cells. "Baculovirus-derived vectors" are expression vectors that are 
derived from baculoviruses; these vectors are commonly used to express foreign genes 
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in insect cells. For example, these vectors find use in expression systems for 
recombinant proteins that require eukaryotic processing systems. It is intended that the 
present invention encompass baculovirus-derived vectors, as well as vectors derived 
from other viruses capable of infecting invertebrate cells. In preferred embodiments, 
the vectors are used to infect insect cells. 

As used herein, the term "selectable marker" refers to the use of a gene which 
encodes an enzymatic activity that confers the ability to grow in medium lacking what 
would otherwise be an essential nutrient (e.g., the HIS3 gene in yeast cells); in 
addition, a selectable marker may confer resistance to an antibiotic or drug upon the- 
cell in which the selectable marker is expressed. Selectable markers may be 
"dominant"; a dominant selectable marker encodes an enzymatic activity which can be 
detected in any eukaryotic cell line. Examples of dominant selectable markers include 
the bacterial aminoglycoside 3' phosphotransferase gene (also referred to as the neo 
gene) which confers resistance to the drug G418 in mammalian cells, the bacterial 
hygromycin G phosphotransferase (hyg) gene which confers resistance to the antibiotic 
hygromycin and the bacterial xanthine-guanine phosphoribosyl transferase gene (also 
referred to as the gpt gene) which confers the ability to grow in the presence of 
mycophenolic acid. Other selectable markers are not dominant in that there use must 
be in conjunction with a cell" line that lacks the relevant enzyme activity. Examples of 
non-dominant selectable markers include the thymidine kinase (tk) gene which is used 
in conjunction with tk' cell lines, the CAD gene which is used in conjunction with 
CAD-deficiem cells and the mammalian hypoxanthine-guanine phosphoribosyl 
transferase (hprt) gene which is used in conjunction with hprC cell lines. A review of 
the use of selectable markers in mammalian cell lines is provided in Sambrook. J. et. 
a!., Molecular Cloning: A Laboratory Manual, 2nd ecL Cold Spring Harbor " 
Laboratory. Press, New York (1989) pp. 16.9-16.1 5. 

As used herein, the term "vector" is used in reference to nucleic acid molecules 
that transfer DNA segment(s) from one cell to another. The term "vehicle" is 
sometimes used interchangeably with "vector." 



- 26 - 



WO 99/18989 



PCT/US 98/2 1763 



The term "expression vector" as used herein refers to a recombinant DNA 
molecule containing a desired coding sequence and appropriate nucleic acid sequences ' 
necessary for the expression of the operably linked coding sequence in a particular 
host organism. Nucleic acid sequences necessary for expression in prokaryotes usually 
include a promoter, an operator (optional), and a ribosome binding site, often along 
with other sequences. Eukaryotic cells are known to utilize promoters, enhancers, and 
termination and polyadenylation signals. 

The terms "in operable combination," "in operable order," and "operably 
linked" as used herein refer to the linkage of nucleic acid sequences in such a manner 
that a nucleic acid molecule capable of directing the transcription of a given gene 
and/or the synthesis of a desired protein molecule is produced. The term also refers to 
the. linkage of amino acid sequences in such a manner so that a functional protein is 
produced. 

As used herein, the term "amplifiable nucleic acid" is used in reference to 
nucleic acids which may be amplified by any amplification method. It is contemplated 
that "amplifiable nucleic acid" will usually comprise "_sample template." 

As used herein, the term "sample template" refers to nucleic acid originating 
from a sample which is analyzed for the presence of "target" (defined below). In 
contrast, "background template" is used in reference to nucleic acid other than sample 
template which may or may not be present in a sample. Background template is most 
often inadvertent. It may be the result of carryover, or it may be due to the presence 
of nucleic acid contaminants sought to be purified away from the sample. For 
example, nucleic acids from organisms other than those to be detected may be present 
as background in a test sample. 

As used herein, the term "primer M ~refers to an oligonucleotide, whether- 
occurring naturally as in a purified restriction digest or produced synthetically, which 
is capable of acting as a point of initiation of synthesis when placed under conditions 
in which synthesis of a primer extension product which is complementary to a nucleic 
acid strand is induced, (/.«?., in the presence of nucleotides and an inducing agent such 
~~ ^rDNA~polymerase-and^ 
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single stranded for maximum efficiency in amplification but may alternatively be 
double stranded. If double stranded, the primer is first treated to separate its strands 
before being used to prepare extension products. Preferably, the primer is an 
oligodeoxyribonucleotide. The primer must be sufficiently long to prime the synthesis 
of extension products in the presence of the inducing agent. The exact lengths of the 
primers will depend on many factors, including temperature, source of primer and the 
use of the method. 

As used herein, the term "probe" refers to an oligonucleotide (i.e.. a sequence 
of nucleotides), whether, occurring naturally as in a purified restriction digest or 
produced synthetically, recombinantly or by PCR amplification, which is capable of 
hybridizing to another oligonucleotide of interest. A probe may be single-stranded or 
double-stranded. Probes are useful, in the detection; identification and isolation of 
particular gene sequences. It is contemplated that any probe used in the present 
invention will be labelled with any "reporter molecule," so that is detectable in any 
detection system, including, but not limited to enzyme (e.g. ELISA, as well as 
enzyme-based histochemical assays), fluorescent, radioactive, and luminescent systems. 
It is not intended that the present invention be limited to any particular detection 
system or label. 

As used herein, the term "target" when used in reference to the polymerase 
chain reaction, refers to the region of nucleic acid bounded by the primers used for 
polymerase chain reaction. Thus, the "target" is sought to be sorted oat from other 
nucleic acid sequences. A "segment" is defined as a region of nucleic acid within the 
target sequence. 

As used herein, the term "polymerase chain reaction" ("PCR") refers to the 
method of K.B. Mullis U.S. Patent Nos. 4,683.195, 4,683,202, and 4,965.188, hereby 
incorporated by reference, which describe a method for increasing the concentration of 
a segment of a target sequence in a mixture of genomic DNA without cloning or 
purification. This process for amplifying the target sequence consists of introducing a 
large excess of two oligonucleotide primers to the DNA mixture containing the desired 
TarpF^^ueTxe^folto a 
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DNA polymerase. The two primers are complementary to their, respective strands of - 
the double stranded target sequence. To effect amplification, the mixture is denatured 
and the primers then annealed to their complementary sequences within the target 
molecule. Following annealing, the primers are extended with a polymerase so as to 
form a new pair of complementary strands. The steps of denaturation, primer annealing 
and polymerase extension can be repeated many times (/.e., denaturation, annealing 
and extension constitute one "cycle"; there can be numerous "cycles") to obtain a high 
concentration of an amplified segment of the desired target sequence. The length of the 
amplified segment of the desired target sequence is determined by the relative 
positions of the primers with respect to each other, and therefore, this length is a 
controllable' parameter. By virtue of the repealing aspect of the process, the method is 
referred to as the "polymerase chain reaction" (hereinafter "PCR"). Because the 
desired amplified segments of the target sequence become the predominant sequences 
(in terms of concentration) in the mixture, they are said to be "PCR amplified". 

With PCR, it is possible to amplify a single copy of a specific target sequence 
in genomic DNA to a level detectable by several different methodologies (eg. t 
hybridization with a'labeled probe; incorporation of biotinylated primers followed by 
avidin-enzyme conjugate detection; incorporation of 32 P-labeled deoxynucleotide 
triphosphates, such as dCTP or dATP, into the amplified segment). In addition to 
genomic DNA, any oligonucleotide sequence can be amplified with the appropriate set 
of primer molecules. In particular the amplified segments created by the PCR process 
itself are, themselves, efficient templates for subsequent PCR amplifications. 

"Amplification" is a special case of nucleic acid replication involving template 
specificity. It is to be contrasted with non-specific template replication {i.e.. 
replication that is template-dependent but not dependent on a specific template). 
Template -specificity is here distinguished from fidelity of replication (i.e., synthesis of. 
the proper polynucleotide sequence) and nucleotide (ribo- or deoxyribo-) specificity. 
Template specificity is frequently described in terms of "target" specificity. Target 
sequences are. "targets" in the sense that they are sought to be sorted out from other 
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nucleic acid. Amplification techniques have been designed primarily for this sorting- 
out. 

Template specificity is achieved in most amplification techniques by the choice 
of enzyme. Amplification enzymes are enzymes that, under conditions they are used, • 
will process only specific sequences of nucleic acid in a heterogeneous mixture of 
nucleic acid. For example, in the case of QP replicase, MDV-1 RNA is the specific 
template for the replicase (D.L. Kacian ei ai, Proc. Natl. Acad. Sci. USA 69:3038 
[1972]). Other nucleic acid will not be replicated by this amplification enzyme. 
Similarly in the case of T7 RNA polymerase, this amplification enzyme has a stringent 
specificity for its own promoters (M. Chamberlin et a!., Nature 228:227 [1970]). In 
the case of T4 DNA ligase, the enzyme will not ligate the two oligonucleotides where 
there is a mismatch between the oligonucleotide substrate and the template at the 
ligation junction (D.Y. Wu and R B. Wallace. Genomics 4:560 [1989]). Finally, Taq 
and Pfu polymerases by virtue of their ability to function at high temperature, are 
found to display high specificity for the sequences bounded and thus defined by the 
primers; the high temperature results in thermodynamic conditions that favor primer 
hybridization with the target sequences and not hybridization with non-target sequences 
(H.A. Erlich (ed.), PCR Technology, Stockton Press [1989]). 

As used herein, the terms "PCR product," "PCR fragment," and "amplification 
product" refer to the resultant mixture of compounds after two or more cycles of the 
PCR steps of denaturation, annealing and extension are complete. These terms 
encompass the case where there . has been amplification of one or more segments of 
one or more target sequences. 

As used herein, the term "amplification reagents" refers to those reagents 
Tdeqxyribonucfeotide triphosphates," buffer, etc:), "needed for ampl ification except for - - 
primers, nucleic acid template and the amplification enzyme. Typically, amplification 
reagents along with other reaction components are placed and contained in a reaction 
vessel (test tube, microwell, etc.). . 
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As used herein, the terms "restriction endonucleases" and "restriction enzymes" 
refer to bacterial enzymes, each of which cut double-stranded DNA at or near a 
specific nucleotide sequence. 

As used herein, the term "recombinant DNA molecule" as used herein refers to 
5 a DNA molecule which is comprised of segments of DNA joined together by means of 

molecular biological techniques. 

DNA molecules are said to have "5 1 ends" and "3* ends" because 
mononucleotides are reacted to make oligonucleotides in a. manner such that the 5' 
phosphate of one mononucleotide pentose ring is attached to the 3' oxygen of its 

10 neighbor in' one direction via a phosphodiester linkage. Therefore, an end of an 

oligonucleotides referred to as the "5' end" if its 5' phosphate is not linked to the 3* 
oxygen of a mononucleotide pentose ring and as the "3' end" if its 3' oxygen is not 
linked to a 5* phosphate of a subsequent mononucleotide pentose ring. As used 
herein, a nucleic acid sequence, even if internal to a larger oligonucleotide, also may 

15 be said to have 5' and 3' ends. In either a linear or circular DNA molecule, discrete 

elements are referred to as being "upstream" or 5' of the "downstream" or 3 ? elements. 
This terminology reflects the fact that "transcription proceeds in a 5 1 to 3' fashion along 
uhe DNA strand. The promoter and enhancer elements which direct transcription of a 
linked gene are generally located 5' or upstream of the coding region. However, 

20 enhancer elements can exert their effect even when located 3 r of the promoter element 

and the coding region. Transcription termination and polyadenylation signals are 
located 3' or downstream of the coding region. 

As used herein, the term "an oligonucleotide having a nucleotide sequence 
encoding a gene" means a nucleic acid sequence comprising the coding region of a 

25- gene or in other words the nucleic acid sequence which encodes a gene product. -The^ 

coding region may be present in either a cDNA, genomic DNA or RNA form. When 
present in a DNA form, the oligonucleotide may be single-stranded (/.<?., the sense - - 
strand) or double-stranded. Suitable control elements such as enhancers/promoters, 
splice junctions, polyadenylation signals, etc. may be placed in close proximity to the 

30 coding region J>fjhgL g^_i^needed_to permit pro per initiat ion of transcri ption, and/or 
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correct processing of the primary RNA transcript. Alternatively, the coding region 
utilized in the expression vectors of the present invention may contain endogenous 
enhancers/promoters, splice junctions, intervening sequences, polyadenylation signals, 
etc. or a combination of both endogenous and exogenous control elements. 
5 As used herein, the term "regulatory element" refers to a genetic element which 

controls some aspect of the expression of nucleic acid sequences. For example, a 
promoter is a regulatory element which facilitates the initiation of transcription of an 
operably linked coding region. Other regulatory elements are splicing signals, 
polyadenylation signals, termination signals, etc. (defined infra). 

10 Transcriptional control signals in eukaryotes comprise "promoter" and 

"enhancer" elements. Promoters and enhancers consist of short arrays of DNA 
sequences that interact specifically with cellular proteins involved in transcription (T. 
Maniatis er al. Science 236:1237 [1987]). Prompter and enhancer elements have been 
isolated from a variety of eukaryotic sources including genes in yeast, insect and 

15 mammalian cells and viruses (analogous control elements, i.e.. promoters, are also 

found in prokaryotes). The selection of a particular promoter and enhancer depends on 
what cell type is to be used to express the protein of interest. Some eukaryotic 
promoters and enhancers have a broad host range while others are functional in a 
limited subset of cell types (for review see, S.D. Voss et al, Trends Biochem. Sci., 

20 1 1:287 [1986]; and T. Maniatis et at ? supra). For example, the SV40 early gene 

enhancer is very active in a wide variety of cell types from many mammalian species, 
and has been widely used for the expression of proteins in mammalian cells (R. 
Dijkema et a/.vEMBO J. f .. 4:761 [1985]). Two other examples of promoter/enhancer 
elements active in a broad range .of ..mammalian cell types are those from the human 

25 elongation factor la gene (T. Ueisuki et aU J. Biol. Chem. ? 264:5791 [1989]; D.W. 

Kim et al. Gene 91:217 [1990]; and S. Mizushima and S. Nagata, Nuc. Acids Res... 
18:5322 [1990]), and the long terminal repeats of the Rous sarcoma virus (CM. 
Gorman et al., Proc. Natl. Acad. Sci. USA 79:6777 [1.982]), and the human 

cytomegalovirus ( M Boshart et a/., C ell 41: 521 [1985 ]). 
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As used herein, the term "promoter/enhancer" denotes a segment of DNA 
which contains sequences capable of providing both promoter and enhancer functions 
(i.e.. the functions provided by a promoter element and an enhancer element, as 
discussed above). For example, the long terminal repeats of retroviruses contain both 
promoter and enhancer functions. The enhancer/promoter may be "endogenous," 
"exogenous," or "heterologous." An "endogenous" enhancer/promoter is one which is 
naturally linked with a given gene in the genome. An "exogenous" or "heterologous" 
enhancer/promoter is one which is placed in juxtaposition to a gene by means of 
genetic manipulation (ie. , molecular biological techniques), such that transcription of 
that gene is directed by the linked enhancer/promoter. 

The presence of "splicing signals" on an expression vector often results in 
higher levels of expression of the recombinant transcript. Splicing signals mediate the 
removal of introns from the primary RNA transcript and consist of a splice donor and 
acceptor site (See e.g., J. Sambrook et at. Molecular Cloning: A Laboraioiy Manual, 
2nd ed.. Cold Spring Harbor Laboratory Press, New York [1989], pp. 16.7-16.8). A 
commonly used splice donor and acceptor site is the splice junction from the 16S RNA 
of SV40. 

Efficient expression of recombinant DNA sequences in eukaryotic cells requires 
expression of signals directing the efficient termination and polyadenylation of the 
resulting transcript. Transcription termination signals are generally found downstream 
of the polyadenylation signal and are a few hundred nucleotides in length. The term 
"polv A site" or "poly A sequence," as -used herein, denotes a DNA sequence that 
directs both the termination and polyadenylation of the nascent RNA transcript. 
Efficient polyadenylation of the recombinant transcript is desirable, as transcripts 
lacking a poly A tail are unstable and are rapidly degraded. The poly A signal utilized 
in an expression vector may be "heterologous" or "endogenous." An endogenous poly 
A signal is one that is found naturally at the 3" end of the coding region of a given 
gene in the genome. An heterologous poly A signal is one which is isolated from one 
gene and placed 3 1 to another gene. A commonly used heterologous poly A signal is 
the SV40 poly A signal. The SV40 poly A signal is contained on a 237 bp 
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BamHl/Bcfl restriction fragment, and directs both termination and polyadenylation (S. * 
Sambrook, supra, at 16.6-16.7). 

Eukaryotic expression vectors may also contain "viral replicons," or "viral 
origins of replication." Viral replicons are viral DNA sequences which allow for the 
5 extrachromosomal replication of a vector in a host cell expressing the appropriate 

replication factors. 

As used herein, the terms "nucleic acid molecule encoding," "DNA sequence 
encoding," and "DNA encoding" refer to the order or sequence of 
deoxyribonucleotides along a strand of deoxyribonucleic acid. The order of these 

10 deoxyribonucleotides determines the order of amino acids along the polypeptide 

(protein) chain. The DNA sequence thus codes for the amino acid sequence. 

The term "Southern blot" refers to the analysis of DNA on agarose or 
acrylamide gels to fractionate. the DNA according to size followed by transfer of the 
DNA from the gel to a solid support, such as nitrocellulose or a nylon membrane. 

15 The immobilized DNA is then probed with a labeled probe to detect DNA species 

complementary to the probe used. The DNA "may be cleaved with restriction enzymes 
prior to electrophoresis. Following electrophoresis, the DNA may be partially 
depurinated and denatured prior to or during transfer to the solid support. Southern 
blots are a standard tool of molecular biologists (See e.g., J. Sambrook et ai supra at 

20 pp 9.31-9-58). 

The term "Northern blot" as used herein refers to the analysis of RNA by 
electrophoresis of RNA on agarose gels to fractionate the RNA according to size 
followed by transfer of the- RNA from the gel to a solid support, such as nitrocellulose 
or a nylon membrane. The immobilized RNA is then probed with a labeled probe to 

25 detect RNA species complementary to the probe used. Northern blots are a standard 

tool of molecular biologists (See e.g., Sambrook ei ai, supra at pp. 7.39-7.52). 

The term "isolated" when used in relation to a nucleic acid, as in "an isolated 
oligonucleotide 0 refers to a nucleic acid sequence that is identified and separated from 
at least one contaminant nucleic acid with which it is ordinarily associated in its 

30 natural source. Isolated nucleic acid is such present in a form or setting that is 
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different from that in which it is found in nature (e.g., in an expression vector). In 
contrast, non-isolated nucleic acids are nucleic acids such as DNA and RNA found in 
the state they exist in nature. For example, a given DNA sequence (e.g., a gene) is 
found on the host cell chromosome in proximity to neighboring genes; RNA 
sequences, such as a specific mRNA sequence encoding a specific protein, are found in 
the cell as a mixture with numerous other mRNAs which encode a multitude of 
proteins. However, isolated nucleic acid encoding a mammalian F-box protein 
includes, by way of example, such nucleic acid in cells ordinarily- expressing an F-box 
protein where the nucleic acid is in a chromosomal location different from that of 
natural cells, or is otherwise flanked by a different nucleic acid sequence than that 
found in nature. The isolated nucleic acid or oligonucleotide may be present in single- 
stranded or double-stranded form. When an isolated nucleic acid or oligonucleotide is 
to be utilized to express a protein, the oligonucleotide will contain at a minimum the 
sense or coding strand (i.e., the oligonucleotide may single-stranded), but may contain 
both the sense and anti-sense strands (i.e., the oligonucleotide may be double- 
stranded). 

As used herein, the term "immunogen" refers to a substance, compound, 
molecule, or other moiety which stimulates the production of an immune response. 
The term "antigen" refers to a substance, compound, molecule, or other moiety that is 
capable of reacting with products of the immune response. For example, F-bbx 
proteins may be used as immunogens to elicit an immune response in an animal to 
produce antibodies directed against the subunit used as an immunogen. The subunit 
may then be used as an antigen in an assay to detect the presence of anti-F-box protein 
-antibodies in the serum of the immunized animal. It is not intended that the present 
invention be limited to antigens or immunogens consisting solely of one protein (/.<?., it 
is intended that the present invention encompass complexes). Nor is it intended that 
the present invention be limited to any particular antigens or immunogens. 

The term "antigenic determinant," as used herein, refers to that portion of a 
molecule ( i.e., an antigen) that makes contact with a particular antibody (i.e., an 
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epitope).. When a protein or fragment of a protein is used to immunize a host animal * 
(e.g., an "immunocompetent" animal with "immunocompetent cells"), numerous 
regions of the protein may induce the production of antibodies which bind specifically 
to a given region or three-dimensional structure on the protein; these regions or 
structures are referred to as antigenic determinants. An antigenic determinant may 
compete with the intact antigen (i.e., the immunogen used to elicit the immune 
response) for binding to an antibody. 

The terms "specific binding" or "specifically binding," as used herein, in 
reference to the interaction of an antibody and a protein or peptide, mean that the 
interaction is dependent upon the presence of a particular structure (i.e.. the antigenic 
determinant or epitope) on the protein; in other words, the antibody is recognizing and 
binding to a specific protein structure rather than to proteins in general. For example, 
if an antibody is specific for epitope "A", the presence of a protein containing epitope 
A (or free, unlabeled A) in a reaction containing labeled "A" and the antibody will 
reduce the amount of labeled A bound to the antibody. 

As used herein, the term "antibody" (or "immunoglobulin"), refers to intact 
molecules as well as fragments thereof, such as Fa, F(ab') 2 , and F\\ which are capable 
of binding the epitopic determinant. Antibodies that bind F-box polypeptides can be ■ 
prepared using intact polypeptides or fragments containing small peptides of interest as 
the immunizing antigen. The polypeptide or peptide used to immunize an animal can - 
be derived from the transition. of RNA or synthesized chemically, and can be 
conjugated to a carrier protein; if desired. Commonly used carriers that are chemically 
coupled to peptides include bovine serum albumin and thyroglobulin. The coupled 
peptide is then used to immunize the animal (e.g., a mouse, a rat. or a rabbit). 

The present invention encompasses polyclonal, as well as monoclonal 
antibodies. The antibodies used in the methods invention may be prepared using 
various immunogens. In one embodiment, the immunogen is a human F-box protein 
or subunit (e.g... any of the amino acid sequences set forth in Tables 2 and 4 used as 
an immunogen) to generate antibodies that recognize human F-box proteins. Such 
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antibodies include, but are not limited to polyclonal, monoclonal, chimeric, single 
chain, Fab fragments, and an Fab expression library. 

Various procedures known in the art may be used for the production of 
polyclonal antibodies to F-box proteins and subunits. For the production of antibody, 
various host animals can be immunized by injection with the peptide corresponding to 
the F-box protein epitope of interest, including but not limited to rabbits, mice, rats, 
sheep, goats, etc. In a preferred embodiment, the peptide is conjugated to an 
immunogenic carrier {e.g., diphtheria toxoid, bovine serum albumin (BSA), or keyhole 
limpet hemocyanin [KLH]). Various adjuvants may be used to increase the 
immunological response, depending on the host species, including but not limited to 
Freund's (complete and incomplete), mineral gels such as aluminum hydroxide, surface 
active substances such as lysolecithin, pluronic polyols, polyanions, peptides : oil 
emulsions, keyhole limpet hemocyanins, dinitrophenol. and potentially useful human 
adjuvants such as BCG (Bacille Calmette-Guerin) and Corynebacterium parvum. 

For preparation of monoclonal antibodies directed toward F-box proteins, any 
technique that provides for the production of antibody molecules by continuous ceil 
lines in culture may be used (See e.g.'. Harlow and Lane, Antibodies: A Laboratory 
Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY). These 
include but are not limited to the hybridoma technique originally developed by Kohler 
and Milstein (Kohler and Milstein, Nature 256:495-497 [1975]), as well as the trioma 
technique, the human B-cell hybridoma technique {See e.g., Kozbor et al. Immunol. 
Today 4:72 [1983]). and the EBV-hybridoma technique to produce human monoclonal 
antibodies (Cole et ai, in Monoclonal Antibodies and Cancer Therapy: Alan R. Liss, 
Inc., pp. 77-96 [1985]). 

In an additional embodiment of the invention, monoclonal antibodies can be 
produced in germ-free animals utilizing recent technology {See e.g., 
PCT/US90/02545). According to the invention, human antibodies may be used and 
can be obtained by using human hybridomas (Cote et ai, Proc. Nath. Acad. Sci. 
U.S.A.80:2026-2030 [1983]) or by transforming human B cells with-EBV virus in 
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vitro (Cole et ai, in Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, pp. 
77-96 [1985]). 

According to the inveniion, techniques described for the production of single 
chain antibodies (U.S. Patent 4,946,778; herein incorporated by reference) can be 
adapted to produce F-box protein-specific single chain antibodies. An additional 
embodiment of the invention utilizes the techniques described for the construction of 
Fab expression libraries (Huse et ai, Science 246:1275-1281 [1989]) to allow rapid 
and easy identification of monoclonal Fab fragments with the desired specificity for 
F-box proteins 

Antibody fragments which contain the idiotype (antigen binding region) of the 
antibody molecule can be generated by known techniques. For example, such 
fragments include but are not limited to: the F(ab')2 fragment which can be produced 
by pepsin digestion of the antibody molecule: the Fab' fragments which can be 
generated by reducing the disulfide bridges of the F(ab')2 fragment, and the Fab 
fragments which can be generated by treating the antibody molecule with papain and a 
reducing agent. _ ... " 

In the production of antibodies, screening for the desired antibody can 
be accomplished by techniques known in the art (e.g., radioimmunoassay, 
ELISA [enzyme-linked immunosorbant assay], "sandwich" immunoassays, 
immunoradiometric assays, gel diffusion precipitin reactions, immunodiffusion 
assays, in situ immunoassays [using colloidal gold, enzyme or radioisotope 
labels, for example], Western Blots, precipitation reactions, agglutination assays {e.g., 
gel agglutination assays, hemagglutination assays, etc.), complement fixation assays, 
immunofluorescence assays, protein A assays, and Immunoelectrophoresis assays, etc. 

As used herein the term "immunogenically-effective amount" refers to that 
amount of an ftnmunogen required to invoke the production of protective levels of 
antibodies in a host upon vaccination. 

As used herein, the term "reporter reagent" or "reporter molecule" is used in 
reference to compounds which are capable of detecting the presence of antibody bound 
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to antigen. For example, a reporter reagent may be a colorimetric substance which is 
attached to an enzymatic substrate. Upon binding of antibody and antigen, the. enzyme 
acts on its substrate and causes the production of a color. Other reporter reagents 
include, but are not limited to fluorogenic and radioactive compounds or molecules. 
5 As used herein the term "signal" is used in reference to the production of a sign 

that a reaction has occurred, for example, binding of antibody to antigen. It is 
contemplated that signals in the form of radioactivity, fluorogenic reactions, and 
enzymatic reactions will be used with the present invention. The signal may be 
assessed quantitatively as well as qualitatively. 

10 As used herein the term "NF-kB regulatory factors" refers to any factors (e.g., 

proteins, enzymes, peptides, small molecules, and nucleic acids) involved in the 
regulation of NF-kB signalling pathways. For example, such factors include, but are 
not limited to, F-box proteins, IkBs. IKKs, and agonists, antagonists, and cofactors that 
interact with these factors. It is contemplated that the NF-kB regulatory factors can 

15 either directly or indirectly {e.g., through other factors) bind to a target of interest 

(e.g., a slimb protein). 

GENERAL DESCRIPTION OF THE INVENTION 

The present invention provides compositions and methods for gene 
identification, as well as drug discovery and assessment. The present invention 

20 provides components of an E3 complex involved in ubiquitination of cell cycle 

regulators and other proteins, as well as members of a class of proteins that directly 
function in recognition of ubiquitination targets. These compositions are involved in 
protein degradation pathways associated with the eukaryotic cell cycle, among others. 
Protein degradation is a commonly employed mechanism for the control of 

25 protein abundance. It is also a particularly effective method for promoting 

unidirectional cell cycle transitions because of its rapidity and irreversibility. Three 
major transitions {i.e.. entry into S phase, separation of sister chromatids, and exit 

from jni_tosis) ? req uire the d egradation o f specific proteins via the ubiquitin-26S 
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proteosome pathway (reviewed in King et al, Science 274:1652-1659 [1996]). 
Ubiquitin is a relatively, small protein (approximately 76 amino acid residues) found in 
all cells of higher organisms. Ubiquitin plays major roles in intracellular protein 
degradation and histone modification. 

Thus, ubiqiiitination is an important mechanism used to regulate protein 
abundance. However, until the development of the present invention, the specificity of 
target selection for ubiquitin dependent proteolysis was largely unknown. Central to 
this process are the E3s which confer substrate specificity on the ubiquitination 
reaction and are therefore likely points for regulation. The present invention provides 
methods for producing ubiquitinated Sicl in vivo and in vitro using recombinant 
proteins. The present invention also provides compositions and methods for the 
development of drugs and other compounds effective in correcting abnormalities in 
protein degradation, based on the demonstration that 1) Cdc53 ? Skpl. and Cdc4 form a 
functional E3 ubiquitin ligase complex that works together with the E2 Cdc34 to 
ubiquitinate Sicl; 2) Cdc4 acts as a receptor for phosphorylated Sicl recognition; and 
3) the sole function Cln/Cdc28 kinases in this process is "to phpsphorylate Sicl, 
allowing recognition by Cdc4. Importantly, it was found that distinct F-box proteins 
can selectively recognize different ubiquitination substrates in a phosphorylation- 
dependent manner. 

The formation of ubiquitin-protein conjugates in protein degradation pathways 
involves three components that participate in a cascade of ubiquitin transfer reactions: 
a ubiquitin activating enzyme (El), a ubiquitin conjugating enzyme (E2).- and a 
specificity factor (E3) (Hershko et aL T J. Biol. Chem.. 267:8807-8812 [1983]). 
Ubiquitin is activated as a thiol -ester on El in an ATP-dependent reaction, transferred 
to an E2 as a thiol ester and ultimately conjugated to the target protein in conjunction 
with an E3 ? which functions in substrate recognition and in some instances may serve 
as a thiol-ubiquitin carrier (Scheffner et aL, Cell 75:495-505 [1993]; and Scheffner et 
aL, Nature 373:81-83 [1995]). Together, these enzymes polyubiquitinate lysine 
_resid.ues_in_targei pro teins t hroug h formatio n of iso peptide bonds w i th ubiquitin, - 
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leading to recognition by the 26S proteosome. This association eventually results in * 
the degradation of the target protein. 

While El and E2 proteins can be identified through sequence similarity, this is 
not yet generally true for E3 proteins. Thus, the present invention provides previously 

5 unreported methods and compositions. This is significant as the identity of E3 

components are a central issue in cell cycle control, among other processes, because 
they are potential regulators of both the timing of ubiquitination and the selection of 
substrates. Prior to the development of the present invention, much of the prior 
knowledge of E3s was provided by analysis of the HECT domain protein E6-AP 

10 which functions as a ubiquitin-ligase for p53, (Huibregtse et a/., Proc. Natl. Acad. Sci. 

USA 92:2563-2567 [1995]; and Scheffner et ai, Nature 373:81-83. [1995]); and the 
. anaphase promoting complex (APC). which functions in the destruction of mitotic 
cyclins and proteins involved. in sister chromatid cohesion (reviewed in King et ai, 
[1996] supra). These APC substrates contain a destruction box motif, although 

15 precisely how the timing and selection of substrates by the APC is achieved is 

unknown. In contrast, timing .of ubiquitination of a variety of non-APC substrates is 
thought to be regulated in part by the phosphorylation of the substrate itself. PEST 
sequences (i.e.. sequences that are rich in proline, glutamic acid, serine and threonine) 
are frequently found in unstable proteins such as cyclins and contain sites of 

20 phosphorylation (Rogers et ai. Science 234:364-368 [1986]). Phosphorylation of 

specific residues has been implicated in the destruction of Gl cyclins in yeast and 
mammalian cells (Tyers et «/., EMBO J., 11:1773-84 [1992]: Lanker et ai. Science - 
271:1597-1601 [1996]; Clurman et ai. Genes Dev., 10:1979-1990 [1996]: Diehl et ai, 

Genes Dev., 1 1 :957-972 _[ 1 997]; and Won and Reed. EMBO J.. 15:4182-4193 [1996]), 

25 and the cyclin-kinase inhibitor (CKI) p27 (Sheaff et ai. Genes Dev.. 1 1:1464-1478 

[1997]). 

In S. cerevisiae, entry into S-phase requires activation of the Cdc28 kinase by 
Gi cyclins (Clnl, Cln2, and Cln3) and S-phase cyclins (Clb5 and Clb6) (Nasmyth, 
Curr. Opin. Cell Biol., 5:166-179 [1993]). Although both Cln/Cdc28 and Clb/Cdc28 
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complexes assemble during Gl, Clb/Cdc28 is sequestered in an inactive form through- 
association with the CK1 p40Sic! (MendenhalL Science 259:216-219 [1993]; and 
Schwob et. aL, Cell 79: 233-244 [1994]), Sicl levels vary in the cell cycle, sharply 
decreasing at the Gl/S transition, and this correlates with activation of Clb5/Cdc28. 
5 The decrease in Sicl levels depends on the E2 Cdc34 s suggesting that ubiquitination 

triggers Sicl destruction (Schwob et aL, Cell 79:233-244 [1994]). Sicl destruction 
also requires CLN and CDC2S function; elimination of Sicl defines the threshold 
requirement for Cln/Cdc28 activity in S-phase entry (Schneider et aL, Science 272: 
' 560-562 [1996]; Schwob et'aL, [1994], supra; and Tyers, Proc. Natl. Acad. Sci. 

10 U.S.A. 93:7772-7776 [1996]). Although Sicl is a phosphoprotein (Schneider et aL. 

Science 272:560-562 [1996]), iris not known whether Cln/Cdc28 complexes directly 
phosphorylate Sicl or whether phosphorylation plays another, perhaps indirect, role in 
Sicl destruction. The development of the present invention provides methods. and 
compositions to resolve these questions. 

15 Three other genes, SKPI, CDC53, and CDC4, are also required for S-phase 

entry (Schwob et aL., J 994, supra; and Bai et aL, Cell "86:263-274 [1996]). These 
genes, together with CDC34, show a pattern of suppression and enhancement 
consistent with roles in a common process; conditional alleles of these genes cause 
arrest with unreplicated DNA and multiple buds (Yochem and Byers, J. Mol. BioL, 

20 195:233-245 [1987]; Goebl et aL, L Mol. Biol., 195:233-245 [1988]: Bai et aL, [1996] 

supra; and Mathias et aL, Mol. Cell. BioL. 16:6634-6643 (1996]). Sicl accumulates 
in cdc34-l. cdc4-l. or skp I -11 mutants, and deletion of SIC] allows such mutants to 
undergo DNA synthesis (Schwob et aL, [1996], supra; and Bai et aL. [1996], supra). 
Components of the Cdc34 pathway have also been implicated in the destruction of a 

25 number of other important regulatory proteins, including Cln2 (Deshaies et aL, EMBO 

J., 14:303-312 [1995]; Bai et aL. [1996] supra; and Willems et aL. Cell 86:453-463 
[1996]), Cdc6 (Piatti et aL, Genes Dev. ? 10:1516-1531 [1996]), the CKIs Ruml and 
Farl (McKinney et aL, Genes Dev., 7:833-843 [1993]; and Kominami and Toda, 
Genes Dev., 11:1548-1560 [1997]), and the transcription factor Gcn4 (Kornitzer et aL, 
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EMBO J., 33:6021-6030 [1994]). Thus, it is contemplated that Cdc34, Cdc53, Skpl, - 
and Cdc4 are utilized for the destruction of diverse regulatory proteins. A requirement 
for Cdc34 for Cln2 ubiquitination has been demonstrated in crude yeast lysates 
(Deshaies et ai, [1995], supra), but this requirement has been suggested to be indirect 
5 (Blondel and Mann, Nature 384:279-282 [1996]). Interestingly, SKP1 is also required 

for the G2/M transition (Bai et ai t [1996], supra; and Connelly and Heiter, Cell 
86:275-285 [1996]), and has been found to be a component of the kinetochore 
complex CBF3 (Connelly and Fleiter, Cell 86:275-285 [1996], supra\ and Stemmann 
and Lechner, EMBO J., 15:3611-3620 [1996]). 

10 Skpl binds to Cdc4, and this interaction involves a motif in Cdc4 referred to as 

the F-box (Bai et ai, [1996], supra). The F-box motif is found in a large number of 
proteins including cyciin F (Bai et at. EMBO J.. 15:361 1-3620 [1994]) and the cyclin 
A/Cdk2 -associated protein Skp2 (Zhang et ai. Cell 82:915-925 [1995]), both of which 
bind Skpl. The two largest classes of F-box proteins either contain WD-40 repeats 

15 (e.g., Cdc4) or leucine-rich repeats (LRR) (e.g., Skp2 and Grrl) (Bai et al. t [1996], 

supra). GRR1 was initially identified as a gene required'for glucose repression (Flick 
and Johnston, Mol. Cell. Biol., 11:5101-12 [1991]) but was later also found to be 
involved in Cln destruction (Barral et ai., Genes Dev., 9:399-409 [1995]). The ' 
discovery that Skpl is required for the destruction of both Sicl and Cln2, while Cdc4 

20 and Grrl were only implicated in the destruction of one of these, led to development 

of one embodiment of the present invention, (i.e., one model) in which F-box proteins 
function to recognize targets for ubiquitination. and Skpl links these F-box/target 
complexes to the ubiquitination machinery. 

The present invention was developed in a stepwise fashion, with an important . 

25 aspect being the elucidation of the role of Skpl and F-box proteins in ubiquitination 

through in vitro reconstruction of the Sicl ubiquitination pathway. . Sicl .ubiquitination 
was found to depend upon each of the proteins implicated in Sicl destruction in vivo. 
For example, Skpl recruits Cdc4 into a Cdc53/Cdc34 complex, and enhances 
recognition of Sicl by Cdc4, with the latter interaction requiring Sicl phosphorylation. 
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In contrast, Grrl does not interact with Sicl, but does recruit phosphorylated Clnl and' 
Cln2 into Skpl/Cdc53 complexes. Thus, the present invention provides F-box proteins 
that function as receptors, which recruit substrates into a Skpl/Cdc53/Cdc34 complex 
for ubiquitination by Cdc34. 

Thus, the present invention provides the function of a class of proteins referred 
to as F-box proteins in targeted ubiquitination. The present invention finds utility in 
methods for developing compounds that affect ubiquitination. The present invention 
also provides numerous novel F-box containing mammalian genes whose encoded 
proteins are contemplated to function in processes including, but not limited, to 
targeted ubiquitination of cellular proteins. Specifically, F-box proteins function as 
receptors for proteins to be ubiquitinated. 

As described in the Examples, through a series of experiments using a set of 
defined proteins found in S. cerevisiae. it was demonstrated that three proteins (/.<?.. 
Cdc53, SkpL and the F-box protein Cdc4) form a complex referred to as an "E3" 
which functions together with an El ubiquitin activating enzyme, and the E2 ubiquitin 
conjugating enzyme Cdc34, to. ubiquitinate the Cdk inhibitor Sicl. Recognition of 
Sicl by this E3 complex requires that Sicl be specifically phosphorylated and 
phosphorylation may be a general mechanism used to regulate the timing of 
ubiquitination of target proteins. Thus, it is contemplated that compounds that alter 
this phosphorylation will in turn, alter the timing of ubiquitination of target proteins. 
Such compounds are contemplated as possible drugs that disrupt at least one pathway 
in which F-box proteins function, and are required for biological and/or biochemical 
processes. 

Cde53 was found to function as an adapter and link Skpl to the E2 f while 
Skpl was found also to function as an adapter and links Cdc53 to the F-box protein . 
Cdc4. Cdc4 was found to function as an adapter to link ubiquitination targets {e.g., 
Sicl) to the Skpl/Cdc53/Cdc34 complex. El is not a.stable component of the 
ubiquitination complex, but is required for ubiquitination of the target protein. The 
_E--box-pr.otein_CQnta^^^ domains. The F-box 

is a sequence of 35-45 amino acids and allows the F-box proteins to enter into 
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complexes with Skpl. F-box proteins also contain additional domains, typically, but - 
not necessarily C-terminal to the F-box sequence, which based on the results with 
Cdc4 function as recognition components for ubiquitination substrates. Cdc4 contains 
'C-terminal WD-40 repeats. Another F-box protein (Grrl) contains leucine rich repeats 
5 which are protein-protein interaction domains. Because Skpl simultaneously forms 

complexes with Cdc53 proteins and an F-box protein, these interactions give rise to 
formation of an E3 complex. Any particular F-box protein may interact 
simultaneously with both Skpl and at least one ubiquitination target. F-box proteins 
may have a single ubiquitination target but it is contemplated that they (i.e., at least 

10 some F-box proteins) also have multiple in vivo ubiquitination targets. For example, 

the data obtained for Cdc4 indicate that it is involved in the destruction of at least two 
proteins, Sicl and Cdc6. Thus, the present invention provides the necessary 
components and methods to alter ubiquitination of target proteins through the use of 
new drugs or other compounds. 

15 Based on the sequence of the yeast genome, it was determined that 5. 

cerevisiae contains nine F-box proteins- CDC4 is required for the destruction of Sicl 
and Cdc6, while Grrl is required for the destruction of the Gl (Cln) cyclins, and 
MET30 is required for proper control of methionine biosynthetic pathways and is 
predicted to control the abundance of Met4. 

20 The present invention also provides methods and compositions useful to 

determine the complexity and diversity of mammalian F-box proteins, as well as the 
identity of F-box proteins from various species, the protein-protein interaction domains 
involved, the proteolytic pathways, and regulatory pathways. For example, the 
mammalian proteins (cyclin. F. Skp2) contain an F-box and associate with Skpl, but 

25 their functions and ubiquitination targets have not been demonstrated. Cyclin F 

contains a cyclin box motif C-terminal to the F-box. Skp2 contains a leucine rich 
motif C-terminal to its F-box. Mouse MD6, an additional mammalian F-box 
containing protein; X54352) is in Genbank but its function is unknown. The present 

Lnyejujon .provides hu man MD6, with the following sequences: 
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LPLELSFYLLKWLDPQTLLTCCLVSKQWNKVISACTEVW (SEQ ID NO:57); and ' 
CTTCCCCTGGAGCTCAGTTTTTATTTGTTAAAATGGCTCGATCCTCAGACTTTA 
CTCACATGCTGCCTCGTCTCTAAACAGTGGAATAAGGTGATAAGTGCCTGTAC 
AGAGGTGTGG (SEQ ID NO:58; AA252600). 

Furthermore, the closest homolog of MD6 is MET30; it is contemplated that 
MD6 plays a homologous role in methionine, biosynthesis in eukaryotes. The present 
invention provides methods and compositions to identify the functions and 
ubiquitination targets of these and other F-box containing proteins. 

The present invention also provides amino acid and DNA sequence information 
for eighteen novel F-box-containing human or mouse genes. As with Cdc4, Grrl, 
Skp2, and cyclin F, these novel F-box proteins have the capacity to associate with 
Skpl and to simultaneously interact with other proteins through other protein-protein 
interaction motifs encoded by regions of their genes other than the F-box. Thus, the 
present invention provides compositions and methods for determining the interaction of 
these proteins with other proteins. 

Mammalian Skpl, by analogy with budding yeast, functions. as an adapter 
linking Skpl to an E2. . It is contemplated that cellular proteins brought jnto 
complexes containing Cdc53 and Skpl by any one of these novel F-box proteins has 
the potential to be ubiquitinated by an E2 (e.g., Cdc34) in combination with an El. It 
is further contemplated that interaction with an F-box protein may also produce an 
alternative regulatory function (e.g., altering subcellular localization of the associated 
protein). Thus, the function of F-box proteins is not necessarily limited to 
ubiquitination, and the present invention provides the methods and compositions to 
make.this determination. It is contemplated that additional F-box containing genes '_. 
will be discovered through the use of two-hybrid screens with Skpl or ubiquitination 
targets as the two-hybrid "bait" (e.g., as described in the Example 6). It is also 
contemplated that additional F-box genes will be discovered through sequencing of the 
mammalian genome and sequence analysis, to determine the homology with existing 
F-box proteins, such as those identified in th e present invention. 
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For example, it is contemplated that cell cycle regulators such as cyclins and - 
cyclin-kinase inhibitors, transcription regulators, proteins involved in DNA replication, 
and other cellular regulatory proteins will be identified and/or investigated using the 
methods and compositions provided by the present invention. It is further 
contemplated that the present invention will find use in elucidating inflammatory 
response and infectious disease processes involving protein degradation, as well as 
development of compounds that control (i.e., either enhance or retard) protein 
degradation as appropriate, to ameliorate the effects of the inflammatory response or 
disease process. 

Thus, it is also contemplated that F-box proteins are involved in regulatory 
pathways important for cellular homostasis and/or growth control. In this context, 
F-box proteins may be involved in the elimination or modification of proteins which 
positively or negatively regulate the cell cycle, which positively or negatively regulate 
transcription, or which positively or negatively regulate the abundance of a protein 
involved in a signaling pathway. Elimination of proteins could.be mediated by the 
26S proteosome after targeted ubiquitination by a E3 complex containing, an F-box 
protein. Ubiquitination without proteolytic destruction may alter the activity of the 
target protein either positively or negatively. Thus, it is contemplated that molecules 
that alter the activities or target specificities of F-box proteins, or the ability of F-box 
proteins to enter into macromolecular complexes such as E3 complexes composed of 
and F-box protein, a Cdc53 homolog and Skpl. will find utility as pharmaceutical 
agents for a variety of diseases. The present invention provides the compositions and 
methods for the identification of molecules (including but not limited to proteins, 
peptides, naturally occuring alkaloids, and synthetic alkaloids) which alter the 
activities, levels; or targets of F-box proteins. 

For example, disruption of the F-box protein/Skpl complex is achieved using 
synthetic molecules, proteins, or peptides which mimic the F-box sequence or its three 
dimensional structure and block association of any F-box protein with Skpl. It is 
contemplated that blockage of this interaction renders the F-box protein non- functional 
with respect to ubiquitination of its target proteins. Similarly, disruption of such 
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complexes is also achieved with synthetic molecules, proteins, or peptides which 
specifically bind the F-box of a particular F-box protein. This approach provides 
specificity for a particular pathway involving a specific F-box. These classes of 
molecules can be identified using various methods, including, but not limited to, 
peptide phage display libraries to identify peptide sequences that bind either an F-box 
sequence of a specific domain in Skpl involved in interaction with the F-box. In this 
method, F-box sequences or Skpl sequences are immobilized on solid supports such as 
a magnetic bead through the use of biotinylated F-box or Skpl sequences and 
streptavidin coated magnetic beads. Phage display libraries are then bound to the 
coated magnetic beads and phage binding the beads are isolated and analyzed for 
binding sequences. 

A similar method involves the use of two-hybrid screens to identify proteins or 
fragments of proteins that bind Skpl or the F-box sequence. Such molecules find use 
in blocking assembly of Skpl /F-box protein complexes in vivo and are useful (either 
directly or as precursors) in the generation of pharmacological agents. 

In another embodiment, dis'ruption of F-box/target interactions are . also 
contemplated. In addition to the F-box, F-box containing proteins may also contain an 
additional interaction domain including but not limited to WD-40 or leucine rich 
repeats. For example, Fl Alpha and F2 Beta contain leucine rich repeats. 
Embodiments of the present invention provide methods to identify targets of F-box. 
proteins which include, but are not limited to cyclins. cell cycle regulators, 
cyclin-kinase inhibitors, p-catenin. IkB, and transcriptional regulators. It is 
contemplated that molecules which either block, enhance, or otherwise facilitate 
' association of any target with any F-box protein are useful as pharmaceutical agents in 
the treatment of human diseases. The approaches described herein provide examples 
of approaches that would yield peptides, proteins, and. naturally occuring or synthetic 
molecules which can bind target recognition motifs in F-box proteins or motifs in the 
target protein responsible for recognizing the F-box protein. It is also contemplated 
■ that-molecules-which^bind jhe^t^m^n^^ck^^mplex formation and thereby block. 
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accelerate, or alter the normal function of the F-box protein, which may include 
(depending upon the particular F-box protein), but is not limited to ubiquitination. 

The present invention also provides experimental strategies to determine 
whether molecules identified in these ways can block complex assembly. It is 
contemplated that binding assays based on immobilized Skpl and soluble F-box 
protein (or vice versa), or immobilized F-box protein and soluble target (or vice • 
versa), will be developed in a manner similar to the development of embodiments of 
the present invention (i.e., with Skpl, Cdc4 [an F-box protein] and Sicl [the target of 
Cdc4]).. Molecules to be tested for their ability to alter either Skpl/F-box protein 
interaction or F-box protein/target interaction may be added to binding reactions and 
the effects of the added agent examined by determining the fraction of soluble protein 
bound 'relative' to that bound in the absence of the agent. It. is also contemplated that 
such an assay be adapted to high throughput screening strategies through the use of 
radiolabeled or otherwise tagged soluble binding protein. 

The present invention also provides evidence for phosphorylation specific 
recognition of target proteins and methods for determining whether recognition of the 
target requires that the target be phosphorylated. It is contemplated that agents that 
block or enhance specific phosphorylation of target proteins to allow recognition by 
F-box proteins will be identified through approaches disclosed herein. It is 
contemplated that such agents will find use as pharmaceutical agents that increase or 
decrease the rate of ubiquitination of target proteins. 

In addition, the present invention finds use in the identification and 
development of compounds effective against viral infection and disease. For example, 
two viral proteins (adenovirus E3-12.9K and baculovirus ORFll ) r appear to essentially 
encode only an F-box, and a S£P/-related gene is present in Chorella virus. As 
viruses subvert the cell cycle in order to replicate, it is contemplated that disruption of 
the ubiquitin-mediated proteolysis pathway would also disrupt viral replication. It is 
possible that F-box containing viruses can inhibit degradation of specific protein 
subsets {e.g., cyclins) to enhance their replication, or promote the degradation of 
specific inhibitory proteins. It is also possible that these proteins may target the 
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destruction of proteins that inhibit or kill the virus. The present invention finds use in- 
development of compositions and methods to inhibit viral replication by interfering 
with the ubiquitin-mediated proteolysis pathway utilized by the virus, as well as by 
upregulating the cellular machinery to enhance proteolysis of viral components. In 
particular, the present invention finds use in identification and development of 
compounds effective against immunodeficiency viruses (e.g., human immunodeficiency 
virus, as well as other viruses such as feline immunodeficiency virus, bovine 
immunodeficiency virus, and simian immunodeficiency virus). 

It is further contemplated that targets of novel human F-box proteins will be 
determined by those experienced in the art by approaches including, but not limited to 
two-hybrid library screens, immunoprecipitation analysis followed by immunoblotting 
with antibodies against candidate targets, peptide mapping, mass spectral analysis, 
peptide sequencing, and/or by screening lambda based expression libraries with F-box 
protein probes. 

For example, the present invention provides an example whereby a novel E3 
ubiquitin ligase complex has. been identified using the methods and compositions 
described herein. In 'particular, the F-box protein slimb (TRCP), was found to 
associate with IkB, providing the potential to screen for factors that regulate the NF- 
kB pathway. This has important implications in the regulation and control of cancer 

c 

and the immune system, among other important physiological effects. 

The present invention also finds use in investigating the function and methods 
of altering protein targets whose abundance is altered in disease. For example, cyclins 
are frequently overexpressed in cancer cells. Thus, mutations in F-box proteins 
involved- in cyclin destruction will lead to cy.clin accumulation; such cyclin 
accumulation may promote inappropriate cell division characteristic of cancer. The 
present invention also finds utility in the identification of mutations in F-box genes 
through various methods, including, but not limited sequence analysis. Southern blot- 
analysis of DNA, etc. Furthermore, the present invention also finds use in assessing 
-alterationsJn_cellulaiii3rotem abundanc e due to o verexpression of particular F-box 
proteins. It is contemplated that such alterations are associated with particular 
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diseases. The present invention also finds use in determination of overexpression 
caused by gene amplification in DNA samples from diseased tissue or individuals 
through such methods as Southern analysis using a particular F-box gene as probe. 

Furthermore, the present invention thereby provides methods and compositions 
for the detection and analysis of abnormalities in proteolytic functions, as well as 
methods and compositions for the development of compounds suitable for use in 
ameliorating such abnormalities. 



DETAILED DESCRJPTION OF THE INVENTION 

As discussed above, the present invention provides compositions and methods 
for uene identification and characterization, as well as drug discovery and assessment. 
In particular, the present invention provides components of an E3 complex involved in 
ubiquitination of cell cycle regulators and other proteins, as well as members of a class 
of proteins that directly function in recognition of ubiquitination targets (/.e.. F-box 
proteins). These compositions are involved in protein "degradation pathways associated 
with the eukaryotic cell cycle. 

Assembly of a Complex Containing Cdc53/Skpl/Cdc4 and the E2 Cdc34 
Strong genetic evidence implicated Cdc34, Cdc53, Skpl. and Cdc4 as. 
molecules involved in the control of S-phase entry through destruction of Sicl. In 
preliminary work, SKPi and CDC4 were found to show reciprocal overproduction 
suppression of their respective temperature sensitive mutants and that Cdc4 physically 
associated with Skpl . A further search. for suppressors using a GA L-driven cDNA 
library revealed that CDC53 overexpression suppresses skpl -1 1. These observations, 
coupled with genetic and physical evidence of a Cdc53/Cdc34 interaction resulted in 
the development of embodiments of the present invention. . 

The first step in assembling the complexes of interest involved co-infection of 
-insect-cells-with-various b aculo virus expressjon^vectors- Insect cells were co- infected 
with various combinations of baculoviruses expressing Myc-tagged Cdc53 (Cdc53 M ), 
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Cdc34, Cdc4, and Skpl. Anti-Myc immune complexes from lysates of these infected- 
cells were immunoblotted to identify associated -proteins (See, Figure 1A). As shown 
in Figure IA, in the presence of all four proteins, anti-Cdc53 M complexes contained 
Cdc4 ? Cdc34, and Skpl (Figure 1A, lane 5). However, in the absence of Skpl, only 
5 low levels of Cdc4 were found to bind with Cdc53 M , regardless of the presence of 

Cdc34 (See, Figure 1A, lanes 7 and 8). To confirm this result, the association of 
Cdc53 M with anti-Cdc4 F immune complexes was analyzed. These results indicated the 
association of Cdc53 M with anti-Cdc4 F immune complexes was also greatly enhanced 
in the presence of Skpl (See, Figure IB). Thus, one function of Skpl is to facilitate 

10 association of Cdc53 with Cdc4. In contrast to Cdc4 ? both Skpl and Cdc34 were 

shown to associate with Cdc53 M in the absence of other yeast proteins (See, Figure 
1A). Furthermore, it appeared that Cdc53 can simultaneously associate with both 
Cdc34 and Skpl. as the association of Gst-Skpl with Cdc34 is enhanced in the 
presence of Cdc53 M (See, Figure 1C). These data indicated that Cdc34. Cdc53, Skpl, 

15 and Cdc4 form a multiprotein complex. 

Phosphorylation of Sicl by CIn/Cdc28 is Required for its Recognition 
by a Cdc4/Skpl/Cdc53 Complex 

While previous studies implicated involvement of Cln/Cdc28-dependent 
phosphorylation in Sicl degradation (Schwob et ai. Cell 79:233-244 [1994]; Schneider 

20 et ai. Science 272:560-562 [1996]; and Tyers, Proc. Natl. Acad. Sci. U.S.A. 93:7772- 

7776 [1996]), until the development of the present invention, it was not clear whether 
Sicl was directly phosphorylated by Cln/Cdc28. or whether this phosphorylation was 
correlative or causative for subsequent Sicl degradation (and if causative, whether this, 
modification played a role in Sicl recognition by the ubiquitination machinery). Nor 

25 was it known whether Cln/Cdc28 might also directly regulate the activity of the 

ubiquitination machinery. Once the methods to generate and purify Clnl/Gst-Cdc28 
and Sicl/Clb5/Gst-Cdc28 complexes from insect cells were established in vitro during 

. the development of the present invention, the determination was made as to wheth er ^ 

- 52 - 



WO 99/18989 



PCT/US98/21763 



any of these components might function in Sicl recognition, and if Sic 1 
phosphorylation plays a role in this process. This aspect of the present invention finds 
use in providing methods for the development of drugs or other compounds suitable 
for prevention and/or treatment of cancers (/.^-uncontrolled cellular growth), as well 
as treatment of other diseases associated with abnormalities in cell cycle control. 

In order to accomplish this, Sicl was. purified to near homogeneity from insect 
cells by virtue of its association with Cib5/Gst-Cdc28 complexes {See. Figure -2B). 
Initially, it was believed that such a complex would represent the primary form. of Sicl 
ubiquitinated in vivo. However, it was found that uninhibited CIb5/Cdc28 in these 
preparations phosphorylated Sicl. making it impossible to directly assess the role of 
. specific phosphorylation by Clns. Therefore, a kinase-impaired Gst-Cdc28f K-) 
containing a mutation in a critical catalytic residue (D145N) was used to assemble Sicl 
complexes. In such complexes, Sicl remains essentially unphosphorylated. however 
the Sicl is readily phosphorylated by Clnl/Cdc28 (See. Figure 2 A). In vitro 
phosphorylation of Sic resulted in a reduction in its electrophoretic mobility (See, 
Figure 2B) ; reminiscent of that observed with Sicl in vivo. 

In the absence of Clnl kinase, the extent of Sicl phosphorylation was found to 
be less than 2%. of that of phosphorylated Sicl, but this modification did not result in 
alterations in electrophoretic mobility. For simplicity, this weakly phosphorylated 
form of Sicl is herein referred to as "unphosphorylated Sicl." 

Cdc4 is the Specificity- Factor for Recognition of Phosphorylated Sicl 

Phosphorylated and unphosphorylated Sicl were used in binding reactions with 

- anti=-Cdc5 3 - immune complexes assembled -and -purified -from insect-cells_(See, Figure. 

2C). Phosphorylated Sicl was found to efficiently associate with Cdc53/Skpl/Cdc4 
complexes; this association was dependent upon the presence of Skpl {See. Figure 2C ? 
lanes 6 and 8). Typically 10-20% of the input phosphorylated Sicl was bound at 
about 20 nM Sicl. In contrast, the extent of binding of unphosphorylated Sicl (See, 
Eieure-2C.-lane_7J)_was_compara_ble to that observed in control immune complexes 
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generated from uninfected cells (See, Figure 2C, lane 3), and was less than 1% of the- 
input Sicl. It was also observed that, consistent with the results in Figure 1, the level 
of Cdc4 found in immune complexes lacking Skpl were more than 10-fold lower than 
that found in the presence of Skpl. Thus, Cdc4 and/or Skpl function as binding 
factors for Sicl, and association of Sicl with this complex requires phosphorylation by 
Clnl/Cdc28. 

In addition, to directly examine the roles of Skpl and Cdc4 in Sicl recognition, 
binding experiments were performed using series of complexes assembled in vivo that 
contained constant high levels of Flag-tagged Skpl (Skpl F ), and increasing quantities 
of Cdc4. These experiments, as described in the Examples, showed that association of 
phosphorylated Sicl with anti-Skpl F immune complexes was absolutely dependent 
upon the presence of Cdc4 (e.g., compare lanes 3 and 9 of Figure 2E). Moreover, 
deleting the last three WD : 40 repeats from the C-terminus of Cdc4 (Cdc4AWD) 
abolished its ability to associate with phosphorylated Sicl (See, Figure 2E ? lanes 10- 
16). Therefore, these experiments indicated that Cdc4 functions as the specificity 
factor for binding of phosphorylated Sicl.. and the Cdc4-Sicl interaction requires an 
intact WD-40 repeat domain in Cdc4. While Skpl alone does not interact with Sicl, it 
stimulates association of Sicl with with Flag-Cdc4 (Cdc4 F ) by about 5-fold (See, 
Figure 2D). The weak association of Sicl with Cdc4 alone (See, Figure 2D, lane 3) 
may reflect the participation of an insect cell Skpl homolog. Although it is not clear 
if Skpl physically contacts Sicl or stabilizes a form of Cdc4 compatible with Sicl 
binding, and such an understanding is not necessary in order to use the present 
invention, these results clearly demonstrated that there is a positive contribution of 
_Skp.l_ia.the Cdc4/Sic_l_ jnteractLoa 

Sicl is Ubiquitinated In Vivo 

While the finding that Cdc4, Skpl, and Cdc53 form a complex that binds both 
phosphorylated Sicl and the E2 Cdc34 was consistent with a role for ubiquitination in 
the regulation of Sicl abundance, prior to the development of the present invention, 
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Sicl had not been demonstrated to be ubiquitinated in vivo. In order to directly 
accomplish this, insect cell lysates were generated from either wild type cells or sic! 
deletion mutants expressing His^Ub** or Ub^ as a negative control, and ubiquitinated 
proteins purified using Nf 2 beads (Willems et al, Cell 86:453-463 [1996]) prior to 
immunoblotting with anti-Sicl antibodies {See, Figure 3 A). 

The K48R mutation in Ub** blocks poiyubiquitiriation and therefore recognition 
by the proteolytic machinery (i.e., proteosome recognition) (Chau et ai 7 Science 
243:1576-1583 [1989]), while the G76A mutation reduces the rate at which hydrolases 
remove ubiquitin conjugates (Hodgins et ai, J. Biol. Chem., 267:8807-8812 [1992]). 
A ladder of bands recognizable by anti-Sicl antibodies was detected in the Ni +2 -bead 
bound proteins from wild type lysates expressing His 6 -Ub^ (See, Figure 3A. lane 8) 
but not in conjugates derived from Ub^-expressing cells or a sic\ deletion strain (See. 
Figure 3 A, lanes 5 and 6).. This result demonstrates that Sicl is ubiquitinated in vivo. 
Thus, the present invention also provides an important therapeutic target for 
development of drugs and other compounds for disease prevention and/or treatment. 

Reconstitution of the Sicl Ubiquitination Pathway Using Purified Proteins 

Once a strategy to generate Cdc4/Skpl/Cdc53 complexes that recognized 
phosphorylated Sicl was developed, the next step was to determine whether these 
complexes can catalyze ubiquitination of Sicl in vitro when supplemented with Cdc34 ? 
El. ATP, and ubiquitin. It was observed that in the presence of all reaction 
components, phosphorylated Sicl in complexes with Clb5/Cdc28 was efficiently 
convened to higher molecular weight conjugates detectable with anti-Sicl antibodies 
(See, Figure 5 B, Jane 6; and Figure 5C, lane 5). In contrast, unphosphorylated Sic 1 
was not detectably ubiquitinated. Sicl ubiquitination absolutely required Cdc34, Cdc4, 
Cdc53, SkpL El and ubiquitin (See e.g. Figure 5B and Figure 5C). as well as yeast 
Skpl. The pattern of high molecular weight Sicl conjugates obtained in. reactions with 
ubiquitin was different from that observed when Gst-Ub^ was used as the ubiquitin 
source,. (See, Figure 5C. compare lanes 5 and 1 1) confirming that the hig h molecular 
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weight forms observed were products of ubiquitination. With Gst-Ub^, the Sicl 
reaction products were integrated into a ladder of bands differing by approximately 35 
kDa, the size of Gst-Ub^ (See, Figure 3C, lane 11). Since Gst-Ub^ had a reduced 
ability to form polyubiquitin chains, the number of bands observed is likely to reflect 
the number of individual lysines ubiquitinated on a single Sicl molecule. The 
ubiquitination reaction was time-dependent and the reaction efficiency ranged from 10- 
40% of the input Sicl protein (See e.g., Figure 3B and 3C). When the reaction was 
performed with pre-bound Sicl, the efficiency was greater than 50%. In addition, it 
was found that greater than 50% of the Sicl ubiquitin conjugates formed after 60 
minutes had dissociated from the Cdc4/Skpl/Cdc53 complex. Neither Gst-Cdc28, 
Clb5. Cdc-53, Skpl, or Cdc4 formed ubiquitin conjugates under the reaction conditions 
employed, although *Cdc34 was ubiquitinated as previously reported. 

To test whether Sicl, ubiquitination requires association with Clb5/Cdc28 
complexes, ubiquitination reactions using Sicl produced in bacteria were performed 
both with and without phosphorylation with Cln2/Cdc28 {See. Figure 3D). As in the 
case of Sicl assembledjn insect cells with Clb5/Cdc28 1 phosphoryiated Sicl from 
bacteria was efficiently ubiquitinated with greater than 90% of the Sicl forming 
ubiquitin conjugates (See, Figure 3D, lane 8), and ubiquitination absolutely required 
Sicl phosphorylation (i.e., unphosphorylated Sicl was not ubiquitinated; See e.g., 
Figure 3D,- lane 4). Thus, phosphorylation of Sicl was shown to be required for its 
recognition by Cdc4 and Skpl. 

Next, it was determined whether Cln/Cdc28. present in small amounts in the 
ubiquitination reaction, is also required for additional steps in the ubiquitination 
process (e\g, to phosphorylate the ubiquitination machinery). This was accomplished 
by treating bacterial Sicl with CIn2/Gst-Cdc28 complexes immobilized on GSH- 
Sepharose beads, removing the complexes from the beads prior to use in ubiquitination 
reactions, and determining whether the complexes were free of soluble kinase by 
immunoblotting with anti-HA antibodies (See ? Figure 3D, lane 3). These results 
Hndi^arprf^hat-^ic-Uphosphor-y-lated in thi s manner was also ef ficient l^i^intjnated^^ 
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(See, Fisure 3D, lane 9). Thus, these data indicated that Sicl phosphorylation 
constitutes the primary requirement of Cln/Cdc28 kinases in Sicl ubiquitination in the 
in vitro reaction. 

Although Sicl was found to be an inhibitor of Cdc28/Clb5 complexes, when 
the kinase complex contained an excess of Sicl, it was incapable of phosphorylating 
Sicl and converting it into a substrate for ubiquitination (Figure 3E shows the reduced 
electrophoretic mobility) and 32 P incorporation. This Clb5/Cdc28-phosphorylated Sicl 
was also a substrate for ubiquitination (See, Figure 3E). Although it is not necessary 
to understand the mechanisms involved in order to use the present invention, 
overexpression of CLB5 can drive S-phase entry in cln- cells and suggests that active 
Clb5/Cdc28 formed during Sicl destruction may collaborate with Cln/Cdc28 to 
complete the Sicl ubiquitination process. 

F-box Proteins are Receptors for Ubiquitination Substrates 

The determination that Cdc4 functions in the recognition and ubiquitination .of 
phosphorylated SicUs consistent with a Junction of F-box proteins being recognition 
of ubiquitination targets. During the development of the present invention, 
investigations into whether specific F-box proteins could have broad specificity and 
interact with multiple targets, or could be relatively restricted in their target specificity, 
perhaps associating with only a single target, were conducted. 

To elucidate the selectivity of F-box proteins, experiments were conducted to 
determine whether substitution of Cdc4 by another F-box protein (Grrl) could support 
Sicl binding and ubiquitination. Grrl has an F-box near its N-terminus and can 
interact simultaneously with-Skpl and Cdc53 when co-expressed in insect cells. Gene 
10-taesed Grrl (Grrl 10 ) was also found to interact simultaneously with Skpl and 
Cdc53 : when co-expressed in insect cells (See. Figure 4A). It was found that Grrl and 
Cdc4 interact with Skpl/Cdc53 in a mutually exclusive manner. In contrast with 
Cdc4, however, the Grrl/Cdc53 interaction in insect cells was not enhanced by co- 
-expression,oLSkpl,_ah^ with t hese comp lexes. 
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Importantly, Grrl assembled with Cdc53/Skpl {i.e., Cdc53/Skpl/Grrl complex) 
was unable to associate with phosphorylated Sicl and did not support ubiquitination of 
phosphorylated Sicl complexes in the in vitro system with purified proteins under 
conditions where Cdc4 readily facilitates Sicl binding and ubiquitination- (Sec, Figures 
4B and C). Therefore, the F-box proteins of some embodiments of the present 
invention display selectivity toward particular targets. 

Recognition of Phosphorylated Clnl and CIn2 by Grrl 

Previous studies have shown that mutations of potential Cdc28 phosphorylation 
sites in the C-terminal PEST domain in Cln2 increase its stability in vivo (Lanker et 
ai. Science 273:1597-1601 [1996]), and that only the phosphorylated form of Cln2 is 
associated with Cdc53 in vivo (Willems et aL, [1996], supra), implicating this 
interaction in the Cln destruction pathway. Cdc28 is required for Cln phosphorylation 
although it has not been determined that the requisite phosphorylation reflects 
autophosphorylation or phosphorylation by a distinct protein kinase. The finding that 
Sicl is recognized by the F-box protein Cdc4, together with a. genetic requirement for 
the F-box protein Grrl in Cln destruction, led to the next step in the development of 
the present invention, namely the examination of whether Grrl functions in recognition 
of phosphorylated Clns. 

To generate Cln proteins for binding reactions. Cln/Gst-Cdc28 complexes were 
isolated from insect cells. In the presence of ATP. both Clnl and Cln2 were found to 
be autophosphorylated, a modification that reduces their electrophoretic mobility (see 
below). To examine whether Grrl can associate with phosphorylated Clns and to 

compare the extent -of selecti vity- of Grr-1 - and -Cdc4-toward Cln binding. anti-Skp l F _ 

immune complexes from cells co-expressing Grrl or Cdc4 in the presence or absence 

of Cdc53 were used in binding reactions with 32 P-labeled Clnl or Cln2 kinase 

complexes. :o P-labeled Sicl was used as a control for Cdc4 binding. Both Clnl and 

Cln2 complexes were found to associate with Grrl/Skpl F /Cdc53 complexes {See, 

Figure 5 A) with an effic i enc y of about 40% of the input Clnl or Cln2 {See, Figure 
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5 A, lanes 8 and 12) and this association did not require Cdc53 (lane 16). In contrast, - 
about 6% of the input Cln proteins associated with Cdc4/Skpl F complexes independent 
of the presence of Cdc53 (lanes 7, 1 1, and 15), compared with 1% association in the 
absence of an F-box protein (lanes 6, 10, 14). The extent of selectivity of .these F-box 
5 proteins for Cln and Sicl was further reflected by the observation that Cln I protein 

present in the phosphorylated Sicl preparation was selectively enriched in Grrl 
complexes (Figure 5A : lane 4). The presence of all proteins in the binding reaction 
was confirmed by immunoblotting (Figure 5B) and the quantities of Cdc4 and Grrl 
were comparable, based on Coomassie staining of SDS gels of immune complexes. 
10 Thus, Grrl and Cdc4 display specificity toward physiological substrates. 

Clnl Phosphorylation is Required for Recognition by Grrl 

If Cln phosphorylation is required for ubiquitination as suggested by genetic 
studies (Lanker et al., [1996], supra; and Witlem et ai 7 [1996], supra), and if Grrl is 
the receptor for Clns, then the Grrl/Cln interaction would be expected to be 

15 phosphorylation dependent. Thus, the next step in the development of the present 

invention was to examine Grrl alone and in complexes with Skpl or Skpl/Cdc53. 
Thus, Grrl alone, or in complexes with Skpl or Skpl/Cdc53 was immunoprecipitated 
from insect cell lysates and used in binding assays with phosphorylated or 
unphosphorylated Clnl complexes (Figure 5C). 

20 Unphosphorylated Clnl was produced in insect cells as a complex with kinase 

deficient Gst-Cdc28(K-), which minimized Clnl autophosphorylation during expression 
and allowed the role of phosphorylation to be tested. As isolated, this Clnl protein 
- migrated as a homogeneous species of approximately 66 kDa (Figure 5C ; lane 1). In 

contrast, phosphorylated Clnl (lane 2) undergoes a dramatic mobility shift to 

25 approximately 80 kDa, consistent with the results observed in vivo. Phosphorylated 

Clnl (and its associated Cdc28 protein) efficiently associated with all Grrl complexes 
(Figure 5C, lanes 6, 8, 10), but was absent from control binding reactions lacking Grrl 
(Fifiure 5C, la ne 4 ). In contrast, t h e levels of unphosp horylated Clnl associated with 
Grrl complexes were compared to that found in binding reactions lacking Grrl (Figure 
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5C, lanes 3, 5, 7, 9). Thus, association of both Clnl with Grrl and Sicl with Cdc4 is 
greatly enhanced by phosphorylation. Although the Grrl/Skpl/Cdc53 complex is 
capable of binding efficiently to phosphorylated Clnl, it was not competent for Clnl 
ubiquitination when supplemented with Cdc34 and El (Figure 5D). Moreover, Cdc4 
complexes that functioned in Sicl ubiquitination also failed to catalyze ubiquitination 
of Clnl (Figure 5D), despite the fact that Clnl can associate, albeit weakly, with Cdc4 
(Figure 5 A). In contrast, identical preparations of phosphorylated Clnl protein were 
efficiently ubiquitinated in partially purified yeast lysates in a Cdc34-dependent 
manner (See e.g t Figure 5E), indicating that this preparation of Clnl is competent for 
ubiquitination. Although an understanding of the mechanism is not necessary in order 
to use the present invention, the absence of Clnl ubiquitination in the purified system 
may reflect the requirement of additional factors or modifications. 

F-box Proteins as Receptors for Ubiquitination Targets 

The present invention contemplates that a large number of proteins contain the 
F-box ? and are thereby implicated in the ubiquitin pathway. The development of the 
present invention has revealed that F-box proteins directly contact ubiquitination 
substrates and can display selectivity in recognition of potential targets for 
ubiquitination. as would be expected of E3 proteins. For example, both Grrl and 
Cdc4 assemble into mutually exclusive complexes with Cdc53 and Skpl (Figure 4), 
However, Grrl does not associate with Sicl, nor does it support Sicl ubiquitination. 
In contrast, it was found that Cln proteins efficiently associate with Grrl/Skpl F 
complexes and with Cdc4/Skpl F (although less efficiently) (See e.g.. Figure 5). 
Although Cdc53 was originally isolated as a Cln2-interacting protein (Willems et ^ 
[1996], supra), the present invention provides evidence that this original interaction 
was bridged by Grrl and possibly Cdc4. The Grrl/Cln interaction is of interest in 
view of the fact that GRRL CDC53, and SKP1 are required for destruction of Cln 
proteins, and suggests that Grrl functions as a component of an E3 for Cln 
ubiquitination. The absence of Cln ubiquitination by purified Grrl complexes is likely 
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to indicate the absence of an essential factor(s) or modifications that are not required - 
for Sicl ubiquitination in vitro, and provides evidence that CIn ubiquitination may be 
more complex than is Sicl ubiquitination. Nonetheless, the present invention provides 
methods, compositions, and models for the development of compounds that interact 
5 with the ubiquitination process, and thereby affect protein degradation through any 

number of routes. 

Despite, the observation that F-box proteins may show selectivity towards 
potential substrates, it is unlikely that F-box proteins will be monospecific. For 
example, in S. pombe, recent genetic data have linked the CDC4 homolog pop + with 

10 the ubiquitination of both the CK1 Ruml and Cdcl8, a regulator of DNA replication 

(Kominami and Toda, Genes Dev., 11:1548-1560 [1997]). In budding yeast, CDC4 
has also been implicated in destruction of the CdclS homolog Cdc6 (Piatti er aL % Genes 
Dev., 10:1516-1531 [1 996]),^ indicating that it too has multiple targets. It was also 
determined that Cdc4 can associate with Clns, albeit less efficiently than with Grrl 

15 (Figure 5). Of importance is the fact that all of the targets of F-box protein mediated 

destruction identified to date are central regulators of key events in the cell including 
DNA replication, cell cycle progression, and nutritional sensing. 

A Cdc53/Cdc4/Skpl E3 Complex is Required for Sicl Ubiquitination by Cdc34 

Sicl destruction is genetically dependent upon Cdc34, Cdc4, Cdc53. and Skpl. 
20 During the development of the present invention, it was determined that these proteins 

are directly involved in the ubiquitination process. As Cdc53 can simultaneously bind 
the E2 Cdc34 and Skpl. it frictions as an adapter linking the Skpl /F-box protein 
complex to E2s (Figure l)._In_turn, Skpl has the ability tojink Cdc4 to Cdc53. Cdc4 

binds both Skpl and the ubiquitination substrate SicL The interaction of Cdc4 with 

r 

25 Skpl was shown to involve the F-box located in the N-terminus of Cdc4, while the 

interaction with Sicl involves Cdc4 , s C-terminal WD-40 repeats (Figure 2). Skpl was 
also shown to be involved in substrate recognition because it enhances the association 
of Cdc4 with phosphorylated Sicl. Cdc4 was shown to act as a receptor that, in 
conjunction with Skpl. recruits substrates to the ubiquitination complex. It is 
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contemplated that any of these proteins could also have carrier roles in the transfer of - 
ubiquitin like E6AP (See e.g., Scheffner et al., Cell 75:495-505 [1995]). However, it 
was determined that mutation of the only conserved cysteine in Skpl or all 6 cysteines 
in Cdc53 did not impair complementation of skpl or cdc53 null mutations, 
respectively, indicating that these two proteins are unlikely to transfer ubiquitin by a 
thio-ester intermediate. 

Phosphorylation Directly Regulates Association of SicI and Cln Proteins with E3s 

A central feature in the recognition of Sicl and Cln by F-bdx proteins is the 
phosphorylation dependent nature of the interaction. Association of Sicl with Cdc4- 
containing complexes and subsequent ubiquitination requires Sicl phosphorylation, as 
shown in Figures 2 and 3. It was also shown that Sicl phosphorylated by excess 
Clb5/Cdc28 kinase can be ubiquitinated in vitro (See, Figure 3E). It is contemplated 
that the initial generation of Clb5/Cdc28 activity at the Gl/S transition could 
potentially accelerate Sicl destruction facilitating the sharp and unidirectional change . 
of state characteristic of cell cycle transitions. 

Similarly, association of Grrl with Cln proteins is greatly enhanced by 
phosphorylation, as indicated in Figure 5. Phosphorylation of specific residues in the 
C-terminal PEST domain of Clnl is required for Cln2 instability (Lanker el al., 
[1996], supra), and phosphorylated Cln2 is found in complexes with Cdc53 in vivo 
(Willems et al, [1996], supra). The present invention shows that Cln/Cdc28 can 
provide a system that functions in vitro. The present invention also provides methods, 
compositions, and models for the determination of whether Cln ubiquitination is 
activated by_ autophosphorylation in trans, as the accumulation of active Cln/Cdc28 
complexes may be required to achieve sufficient Cln phosphorylation to promote its 
destruction. 

While regulating the association of F-box proteins through substrate 
phosphorylation is an effective method controlling the timing of ubiquitination, it is 
not necessaril y the case that all F-box proteins will recognize their substrates in a 



- 62 - 



WO 99/18989 



PCT/US98/21763 



phosphorylation dependent manner. Observations made during the development of the * 
present invention indicate that WD-40 and LRU containing F-box proteins can interact 
with phosphorylated substrates, but approximately half of the known F-box proteins do 
not have obvious protein interaction motifs. Nonetheless, the present invention 
provides methods, compositions, and models to determine whether the interaction of 
these proteins with their targets is regulated by phosphorylation or even involves 
ubiquitination.- "The timing of ubiquitination could be controlled by mechanisms 
unrelated to substrate phosphorylation, such as controlled accessibility of substrates or 
regulated expression, localization, or modification of the F-box protein, thus providing 
methods for development of compounds that affect proteolysis. 

While the abundance of Cdc4 is not cell cycle regulated, the F-box protein 
Skp2 displays cell cycle-regulated mRNA abundance which peaks' in S-phase ? 
consistent with its association with cyclin A during that phase of the cycle (Zhang et 
aL, [1995], supra). In vivo, association of Grrl and Skpl is enhanced in the presence 
of glucose in a post-translational mechanism. 

A large number of proteins contain PEST sequences and in a subset of these 
proteins, these sequences have been shown to be phosphorylated and to mediate 
instability. The development of one embodiment of the present invention focused on 
the role of Skpl and F-box proteins in assembly of a ubiquitination complexes that 
recognizes specific phosphorylated proteins. While the particular complex' defined by 
this embodiment of the present invention is unlikely to be responsible for recognition 
of all PEST-dependent proteolysis substrates, this complex is likely to be the prototype 
for a diverse set of complexes in higher eukaryotes. Five CDC53 homologs have been 
identified in mammals (Cull-5; Kipreos et aL, Cell 85:829-839[1996]) : approximately 
15 E2-related genes exist in S. cerevisiae alone, several dozen F-box containing 
proteins have been identified in several species, and several SKP1 related genes exist 
in C elegans and are likely to exist in mammals as well. It is clear that the present 
invention provides methods, compositions, and models to identify PEST-dependent 
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proteolysis substrates in these and other organisms, as well as providing the flexibility . 
to differentially regulate the ubiquitination of a very large number of substrates. 

Other Applications 

In addition, various embodiments of the present invention find use in other 
settings. For example, the methods, compositions, and models of the present 
invention provide the tools to determine the function of such proteins as eiongin C, a 
Skpl-related protein is part of a complex containing the Cdc53-related protein Cul2, 
the von Hippel-Lindau (VHL) tumor suppressor protein, eiongin B, and eiongin A, a 
protein that is also found in association with eiongin C, and contains an. F-box. Thus, 
the present invention provides the means to develop compounds that affect systems 
other than ubiquitination-mediated proteolysis. 

Indeed, the F-box-directed ES complex (FEC) embodiment described in detail 
herein, represents one example of a pathway through which protein kinases control the 
stability of target proteins. In view of the large number of protein kinases and 
possible FECs, this pathway may be second only to transcriptional regulation in the 
control of protein abundance. While the specific examples described herein focus on 
the concern the cell cycle, the present invention provides methods, compositions and 
models applicable to other, diverse regulatory systems. 

Although an understanding of the mechanism is not necessary in order to use 
the present invention, Figure 6A provides a model in which a protein kinase 
phosphorylates target proteins, thus activating them for association with their receptors, 
the F-box proteins. Although some F-box proteins may already be associated with a 
Skpl/Cdc53 complex prior to association with substrates, as shown in Figure 6 A, it is 
also possible that F-box proteins exist in a unbound form, and that association of the 
F-box protein with the substrate drives association with Skpl/Cdc53. Since Skpl 
enhances the association of Cdc4 with Sicl, depending on the relative Kd values for 
individual interactions and concentrations of the constituents, association of the target 
with an F-box protein may enhance association with Skpl. Once the ubiquitination 
complex is formed and polyubiquitination takes place with the assistance of El and E2 

- 64 - 



WO 99/18989 



PCT/US98/21763 



proteins, the substrate is then released and recognized to the 26S proteosome where it . 
is proteolyzed. 

As indicated in Figure 6B, it is contemplated that other combinations of FEC 
(or "SCF") complexes exist in cells. For example, the F-box protein Met30 is closely 
related to Cdc4, and is required for repression of genes in the methionine biosynthetic 
pathway in the presence of S-adenosylmethionine (AdoMet) (See, Thomas et ai, Mol 
Cell. Biol., 15:6526-6534 [1995]). Met30 forms a complex with Met4, a transcription 
factor required for methionine biosynthetic gene expression. The present invention 
provides the means to determine whether Met4 is ubiquitinated in response to 
adomethionine. Furthermore, although the primary embodiment of the present 
invention has focused on Cdc34, the present invention provides means to determine 
whether other E2s are capable of functioning in the context of FECs. 

Also, as shown in Figure 6C, SCF complexes (i.e., Skpl, Cdc53. and Cdc4 
present in a multiprotein complex), work together with protein kinase signalling 
pathways to control protein abundance. Figure 6C illustrates one such pathway, in 
which SCF pathways function multiple times in the transition from Gl to S phase in S. 
cerevisiae. 

Like protein synthesis, protein destruction is a fundamental mechanism used by 
organisms to manipulate their function. In one embodiment, the present invention 
provides the composition of an E3 complex, FEC, involved in selection of 
ubiquitination substrates. Because the constituents of this complex are members of 
protein families, the present invention provides the prototype for a large class of E3s 
formed by combinatorial interactions of related family members as indicated in Figure 
6B. The identification of F-box proteins as the receptor components of this ubiquitin 
ligase provides the means for identification of the key regulatory molecules controlled 
by ubiquitin-mediated proteolysis. Thus, the present invention provides means for the 
- elucidation of the biochemistry of this general ubiquitination pathway is likely to have 
important ramifications for many aspects of biology including cell proliferation, 
development, and differentiation. 
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The Present Invention in Action * 

The following example is provided to illustrate one specific application of the 
present invention. In this example, the methods and compositions of the present 
invention are used to identify a novel E3 ubiquitin ligase complex that finds use in 
such applications as the ubiquitination of IkB, which has direct impact on the 
regulation of NF-kB activity and associated cellular pathways. The findings of these 
studies provide new therapeutic targets for the NF-kB pathway that can diversify the 
existing programs for drug development. 

The NF-kB pathway has many important physiological roles and has become 
the focus of intense interest as a target for drug development. For example, the NF- 
kB pathway has been implicated in regulation of apoptosis. Hallmarks of transformed 
cells include the ability to proliferate with reduced growth factor levels and defects in 
the ability to undergo apoptosis. Many cell types contain signaling systems that 
recognize inappropriate proliferation and respond by activating an intrinsic apoptotic 
pathway leading to cell loss. For example, it has been shown that, loss of the Cdk 
inhibitor p57 in the lens leads to both inappropriate proliferation and increased 
apoptosis (Zhang et at., Nature 387:151 [1997]). As such, transformation pathways 
frequently include some process that either inactivates a component of the apoptotic 
machinery, activates a survival pathway, or both. TNF-ct, a pro-inflammatory 
cytokine, functions in part to activate NF-kB, a transcription factor composed of p50 
and p65/Rel subunits (Baeuerle and Baltimore, Cell 87:13 [1996]; Beg et al. ..Mol. 
Cell. Biol. 13:3301 [1993]; DiDonato et aL. Mol. Cell. Biol. 15:1302 [1995]; and 
Tewari and Dixit, Genes Devel. 6:39 [1996]); NF-kB also activates the expression 
of a large number of genes, including growth factors, chemokines, and adhesion 
molecules which mediate inflammatory responses. 

TNF-a has also been shown to induce particular cell types to undergo 
apoptosis, although the cytotoxic effects are revealed most frequently only if 
protein/RNA synthesis is blocked (Tewari and Dixit, supra). Recent studies have 
revealed that the inability of cells to undergo apoptosis in response to TNF-a reflects 
activation of a survival pathway, which is programmed by NF-kB action (Beg and 
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Baltimore, Science 274:782 [1996]; Liu et al. f Cell 87:565 [1996]; Van Antwerp et al. T 
Science 274:787 [1996]; and Wang et aL, Science 274:784 [1996]). Cells lacking 
RelA or blocked for NF-kB nuclear translocation are sensitive to TNF-mediated killing 
(Beg and Baltimore [1996], supra; and Wang et al. y supra). Moreover, induction of 
5 NF-kB activity protects cells against TNF-mediated cell death (Van Antwerp et aL, 

supra). TNF-a may induce cell death through one pathway and simultaneously induce 
a protective mechanism through NF-kB (Beg and Baltimore [1996], supra). These 
studies indicated' for the first time an important role for NF-kB in cell survival 
pathways and suggested inhibition of NF-kB function might be used to predispose 

10 cancer cells to killing by TNF-a or chemotherapeutic compounds. 

In principle, agents that block NF-kB function could inactivate the cell survival 
pathway set in motion by NF-kB, rendering cells capable of undergoing apoptosis. In 
addition to its survival functions, there is evidence that NF-kB may play growth 
promoting roles by activating transcription of myc, which may drive the cell cycle 

15 forward (reviewed by Sovak et aL, L Clin. Invest. 100:2952 [1997]). There is 

accumulating evidence that NF-kB is used to set up a survival pathway in transformed 
mammary cells. Activated nuclear NF-kB is prominent in mammary tumor lines 
(Nakshatri et aL, Mol. Cell. Biol. 17:3629 [1997]), but rare in normal mammary 
epithelial cells, and recent studies indicate blocking NF-kB in this setting can induce 

20 apoptosis (Sovak et aL, supra). Other cell types such as B-cells also undergo 

apoptosis when NF-kB is inhibited (Wu et aL, EMBO J. 15:4682 [1996]). It is 
possible that NF-kB is normally used to protect particular mammary cells from 
apoptosis, which is occurring as part of the normal biology of the system, and that 
transformation takes advantage of this property. In addition, NF-kB activation in 

25 mammary tumor cells correlates with ER-independent proliferation (Nakshatri, et aL, 

supra), suggesting a possible link between estrogen responsiveness and apoptosis. 

For the last several years, there has been interest in the drugs that block NF-kB 
activation for use in anti-inflammatory diseases (See e.g., VogeL Science 281:1943 
[1998]), an interest that has been strengthened by the finding that aspirin functions to 
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block the NF-kB pathway (Grilli et al., Science 274:1383 [1996]). The finding that . 
NF-kB also functions in cell survival has led to the realization that drugs that affect 
this pathway may also be useful in cancer treatment. The insensitivity of some tumor 
cells to chemotherapeutics may reflect an inability to undergo apoptosis and 
interestingly, inhibitors of NF-kB can correct the radiation sensitivity of cells mutant 
in the AT gene (Jung et al., Science 268: 16 19 [1995]). Thus, NF-kB inhibitors may 
find use as an adjunct to chemotherapy/radiotherapy. Thus, it is contemplated that a 
more complete understanding of the NF-kB activation pathway will lead to the 
identification of new therapeutic targets. 

NF-kB activation involves a multi-step signal transduction pathway (Baeuerle 
and Baltimore, supra) involving receptor activation, activation of kinases (IKKa and 
IKK(3) that phosphorylate IkB (the endogenous inhibitor of NF-kB) ; ubiquitination of 
IkB, proteolysis of IkB. and translocation of NF-kB to the nucleus. Recent advances 
include identification of IKKs (DiDonato et al., Nature 388:548 [1997]: Mercuric et 
al., Science 278:860 [1997]; Regnier et al., Cell 90:373 [1997]; Woronicz et ai. 
Science 278:866 [1997]; and Zandi <?/ ai, Nature 387:151 [1997]) and the components 
of the TNF receptor complex (reviewed by Tewari and Dixit, supra). In contrast, 
prior to the present invention, virtually nothing was known about the molecules that 
function in the ubiquitination step. 

As discussed above, the present invention provides a novel E3 ubiquitin hgase 
complex that provides means to identify therapeutic targets for regulating NF-kB 
activity, to identify the molecular determinants that confer the' ability of this ligase to 
recognize phosphorylated IkB, and to identify molecules that can disrupt this 
interaction. ........ 

A. Background regulation of NF-kB function 

NF-kB activity is regulated primarily through its sub-cellular localization 
(Baeuerle and Baltimore, supra). In the absence of signal., NF-kB is sequestered in the 
cytoplasm by interaction with a member of the IkB (inhibitor of kB) family of proteins 



- 68 - 



WO 99/18989 



PCT/US98/21763 



(Baeuerie and Baltimore, Science 242:540 [1988]). IkB binds to p50/p65 heterodimers 
and simultaneously blocks both the nuclear localization signal and the ability of NF-kB 
to bind DNA (Beg et aL, Genes & Devel. 6:1899 [1992]; Luque and Gelinas, Mol. 
Cell. Biol. 18:1213 [1998]; and Thompson et aL, Cell 80:573 [1995]). In response to 
5 stimuli intended to activate NF-kB, IkB is rapidly phosphorylated (Beg et aL, Mot. 

Cell. Biol. 13:3301 [1993]; Brown et aL, Science 267:1485 [1995]; Chen et aL, Genes 
& Devel. 9:1586 [1995]; DiDonato et aL, [1995], supra] Finco et aL, Proc. Natl. 
Acad. Sci. 91:1 1884 [1994]; Lin et aL, Proc. Natl. Acad. Sci. 92:552 [1995]; and Liu 
et aL, Cell 87:565 [1996]). This signals IkB to be destroyed by ubiquitin mediated 
10 proteolysis, allowing NF-kB to translocate to the nucleus to activate target genes 

(Alkaiay etal., Proc : Natl. Acad. Sci. 92:10599; Henkel et aL, Nature 365:182 [1993]; 
and Scherer et aL, Proc. Natl. Acad. Sci. 92:11259 [1995]). The identity and 
regulation of the ubiquitin' ligase that functions in NF-kB ubiquitination was unknown 
in the art. 

15 A key component of this signaling pathway involves activation- of kinases 

responsible for IkB phosphorylation," since this step (i.e.; "IkB phosphorylation) is 
thought to be the rate-limiting step in NF-kB activation. Signaling molecules such as 
TNF, which promoter NF-kB activation in particular cell types, bind to TNF receptors 
that link to the death domain protein TRADD, and TRAF1/2 which contain a TRAF 

20 domain (Tewari and Dixit, supra). These proteins function in the transient activation 

of two kinases, IKKa and IKKP, which are part of a large (700 kd) complex whose 
other components are not yet fully defined (DiDonato et aL. [1997], supra: Mercurio 
et aL, supra: Regnier et aL, Cell 90:373, [1997]; Woronicz et aL. Science 278:866 
_ [1997]; and Zandi et aL, supra). In vitro, each of these ; kinases specifically 

25 phosphorylate IkB on two serine residues (Ser-32 and Ser-36) and this combination of 

phospho-serine residues is thought to be the targeting signal for IkB ubiquitination, 
although how this signal is recognized and utilized is not known in the art 
Importantly, overexpression of non-phosphorylatable IkB has been shown to be 

, effect ive in blocking NF-kB activation (Woronicz et aL, supra). 
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B. Phosphorylation specific association of IkB with Skpl 

The role of SCF complexes in phosphorylation-dependent ubiquitination led to 
the examination of whether IkB might associate with Skpl. HeLa cell lysates were 
incubated with agarose beads (Affigel beads) containing unphosphorylated and 
phosphorylated IkB sequences overlapping the ubiquitination targeting signal 
previously identified in IkB (Yaron et aL, EMBO J. 16:6486 [1997]; containing Ser-32 
and Ser-36) and the presence of Skpl in iKB-associated proteins examined by 
immunoblotting, as shown in Figure 8. The results demonstrated that Skpl 
specifically associated with phosphorylated IkB but not unphosphorylated IkB. Skpl 
is a highly abundant protein and is thought to be distributed among multiple F-box 
proteins and possibly kirietochore complexes. It is estimated that -1% of the Skpl in 
these extracts can associate with IkB in vitro, 

C. slimb F-box protein associates with phosphorylated I/cB 

Having found that Skpl can associate with phosphorylated IkB, a number of F- 
box proteins were surveyed for association with phosphorylated IkB. Various F-box 
proteins were produced by in vitro translation and tested for binding to phospho-lKB 
and IkB. In particular, a variety of in vitro translated F-box proteins containing LRRs 
(Skp2, F alpha), WD40 repeats (MD6, Met30), a cyclin box (cyclin F), and no obvious 
additional domains (F gamma) failed to interact with phosphorylated IkB, as shown in 
Figure 9. In contrast, the slimb protein (also referred to herein as "TRCP protein") 
specifically associated with phosphorylated IkB. suggesting that slimb F-box protein 
plays a role in I kB/NF-kB .regulation. 

To- determine cell. types where slimb might function, asystematic in situ 
hybridization analysis was initiated to determine patterns of slimb expression in adult 
mouse tissues and during development. A section through a El 2.5 day mouse was 
subjected to in situ hybridization 35 S-Iabeled mouse slimb antisense RjNA using 
established procedures (Zhang et aL. supra). Analysis demonstrated that slimb is 
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expressed at maximal levels in the ventricles of the forebrain and hindbrain, lung, and, 
liver. Weaker expression was observed throughout most of the embryo. 

D. slimb/Skpl associates with phosphorylated I/cB 

The finding that Skpl and slimb can both form complexes with phosphorylated 
5 IicB beads, together with the fact that slimb contains an F-box ? led to the examination 

of whether slimb can associate with Skpl and Cull in vivo. Although every F-box 
protein tested to date interacts with Skpl, there are 6 Cul homologs and it is not clear 
at present whether they all bind to Skpl or only a subset bind to Skpl. To examine 
these interactions, 293 T-cells were transfected with vectors expressing various tagged 

10 versions of Skpl, slimb, and cull as shown in Figure 10A and 10B. To assemble the 

SCF/slimb complex, plasmids expressing cull HA, SkplHA-3. and s!imbMYC9 were 
transfected in the indicated combinations (Figure 10A) into 293 cells using lipofection. 
After 48 hours, cells were disrupted in lysis buffer (10 mM Tris-HCl. 0.5% nonidet P- 
40, 150 mM NaCL 10 mM beta-glycerolphosphate) and insoluble material removed by 

15 centrifugation. Lysates (1 mg of protein) were subjected to immunoprecipitation using 

anti-myc antibodies. Immune complexes were washed three times in lysis buffer and 
were separated by SDS-PAGE and transferred to nitrocellulose. Blots were developed 
using anti-HA, anti-SkpL and anti-myc antibodies. Figure 10B shows lysates from the 
indicated transfections that were subjected to immunoblotting using the indicated 

20 antibodies. 

Additionally immunoprecipitation/Western blotting experiments were 
performed as shown in Figure 1.1 A and 1 IB. In Figure 1 1 A. the indicated plasmids 
were^ transfected into 293T cells and after 48 hours, lysates were made and subjected 
to immunoprecipitation using anti-HA antibodies to precipitate Cul 1 . The presence of 

25 slimb and Skpl were determined using anti-myc and anti-Skpl antibodies. The myc9- 

tagged Skpl migrates at approximately 30 kilodaltons compared to 19 kd for untagged 
Skpl. 

The data in Figures TO and 1 1 demonstrate that Cull immunocomplexes contain 
Skpl, as expected, but also contain slimb. Likewise, although the data are not 
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included herein, it was shown that slimb immunocomplexes contain Skp'l and Cull. ^ 
To examine whether slimb/Skpl complexes are capable of associating with 
phosphorylated IkB, 293 T-cells were transfected with CMV-HA slimb and CMV-HA 
Skpl and lysates subsequently incubated with IkB or phosphorylated IkB beads prior 
to SDS-PAGE and Western analysis with anti-HA antibodies as shown in Figure 1 IB. 
In this Figure, lysates from the indicated transfected cells were subjected to binding 
reactions using immobilized IkB or phospho-lKB. After washing, bound proteins were 
subjected to immunoblotting with anti-HA to visualize slimb and Skpl proteins. As 
shown, Skpl and slimb associate specifically with phospho-lKB (i.e., transfected slimb 
and Skpl assemble into complexes that are recognized by phospho-lKB). 

HA slimb was found to associate with phosphorylated but not unphosphoryiated 
IkB beads with or without transfection of Skpl. HASkpl also associated with IkB in 
a phosphorylation-specific manner. Upon longer exposure of this blot. HASkpl was 
detectable in complexes with phosphorylated (but not unphosphoryiated) IkB in lysates 
from cells transfected with HA Skpl alone, suggesting that HA Skpl can assemble 
with the endogenous slimb protein. - Previous studies indicated that a peptide 
containing the sequence KKERLLDDRHDSGLDSMKDEE (residues 21-41 from IkB; 
SEQ ID NO:60) will not inhibit IkB ubiquitination when added in vitro to a crude cell 
lysate which supports IkB ubiquitination in a manner that is dependent upon the 
phosphorylation of Ser-32 and Ser-36 in IkB. In contrast, the same peptide that has 
been phosphorylated on Ser-32 and Ser-36 will block the ubiquitination of IkB. 
Similarly, phosphorylated IkB peptide will block nuclear translocation of NF-kB in 
intact cells in response to stimuli while the unphosphoryiated peptide will not. It is 
known that IkB needs to be phosphorylated on these two serines by IKK for 
ubiquitination to occur and this phosphorylation serves as the signal. These 
phosphopeptides derived from IkB are thought to block IkB ubiquitination by 
competing with the full-length IkB substrate for the recognition factor of the ubiquitin 
ligase that is normally functioning in IkB ubiquitination. Thus, the finding of the 
present invention that this same phosphorylated IkB peptide, but not the 
unphosphoryiated peptide, will specifically interact with the SCF slimb complex 
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suggests that this slimb complex is the ubiquitin ligase for IkB. Thus, the present 
invention provides a novel E3 ubiquitin ligase complex, thereby providing means to 
identify therapeutic targets for regulating NF-kB activity, to identify the molecular 
determinants that confer the ability of this ligase to recognize phosphorylated IkB ; and 
to identify molecules that can disrupt this interaction. 

These studies have revealed that slimb recognizes the phosphorylated targeting 
signal in IkB. It is contemplated that other cellular or viral proteins contain these 
sequences and will be therefore targeted to the slimb ubiquitin ligase. Although this 
sequence is recognized by slimb, it is further contemplated that other unrelated 
sequences may also interact with slimb possibly through independent domains. It is 
also contemplated that other F-box proteins containing analogous mutations will find 
use to demonstrate the specificity of the dominant negative effect. 

E. Further characterization 

Using the methods and compositions of the present invention, there are several .. 
approaches available to further characterize the relationship between the SCF slimb 
complex and IkB. These include both in vivo and in vitro approaches. 

In vivo: In one embodiment of the. present invention, NF-kB activation or IkB 
destruction is blocked using a dominant negative form of slimb. A dominant negative 
form of slimb is one that will still bind to IkB but will not assemble with the 
cull/Skpl complex. Therefore, the dominant negative slimb protein, when expressed 
at sufficient levels in transfected cells, would bind phosphorylated IkB. thereby 
blocking access of the endogenous slimb protein to IkB. Since this dominant negative 
IkB is not assembled with cull/Skpl complexes, appropriate E2 conjugating enzymes 
would not be physically coupled to IkB and would therefore not carry out the 
ubiquitination reaction. Many forms of slimb find use as dominant negative proteins 
and are made using methods standard in the art. For example, in preferred 
embodiments, versions of slimb that either lack the F-box domain or contain one or 
_mo.re_.point mutations in the F-box domain arejused. This domain^ is required for 
interaction with SkpL and mutation of the F-box in the appropriate residues blocks 
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association with Skpl. The preferred residues to be useful in this regard include those m 
that are highly conserved in other F-boxes. Association with Skpl in vitro could be 
used to demonstrate that the mutant slimb protein no longer interacts with Skpl. The 
function of the slimb dominant negative protein is assessed, for example, by 
monitoring NF-kB activity on a reporter construct the translocation of NF-kB to the 
nucleus in response to TNF treatment, or stabilization of IkB protein levels. 

In vitro: In one embodiment of the present invention, the rate of IkB 
ubiquitination in cells is directly altered by blocking or activating slimb function. For 
example, in one series of experiments, the ubiquitination of IkB is blocked using slimb 
mutants (i.e., dominant negative F-box mutants) that bind IkB but not Skpl, thereby 
uncoupling IkB's ability to associate with endogenous SCF slimb when the mutant is 
overexpression. A set of conserved residues in the F-box whose mutation abolishes 
interaction of the F-box protein Cdc4 with Skpl has previously been identified (Bai et 
a/., Cell 86:263 [1996]).. In one set of experiments, two sets of conserved F-box 
residues (LP and IL) in slimb are mutated to AA and act to verify binding to phospho- 
IkB but not Skpl in vitro. Appropriate. mutants are transfected into HeLa cells and the 
effects on TNF-induced activation of NF-kB is assessed using three primary assays: 1) 
pulse chase analysis of IkB (when a high level of transfection is achieved), 2) NF-kB 
activated reporter (e.g., luciferase) activity, and/or 3) entry of Re! into the nucleus by 
immunofluorescence. Other F-box proteins (including the WD40 containing MD6), 
mutant in the F-box. are used as controls. 

The results of experiments conducted during the development of the present 
invention indicate that slimb levels are low compared to, the levels of transfected 
slimb. Thus, in preferred embodiments of the present invention, the dominant 
approach is used. For confirmation or as alternative embodiments, other approaches 
such as antisense are used. For example, the antisense approach has been used to 
successfully block IKK activity (DiDonato et al., [1997], supra). 
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In yet other embodiments of the present invention, the role of slimb in the 
ubiquitination of IkB is characterized. In one embodiment, overexpression of SCF 
slimb components is used to enhance the unstimulated rate of IkB ubiquitination. 

In other embodiments, the activity of the SCF slimb complex toward IkB is 
demonstrated in vitro. For example, experiments conducted during the development of 
the present invention have demonstrated that cells can be transfected with slimb, cull, 
and Skpl to generate complexes. In the in viiro embodiments, experiments are 
conducted to examine whether slimb alone or in combination with Skpl and Cull 
accelerates ubiquitination of endogenous or co-transfected IkB, using pulse chase 
analysis or direct ubiquitination assays. Overexpression of IkB increases its levels 
such that the endogenous slimb complex does not efficiently ubiquitinate it thereby 
providing a window for acceleration by exogenous slimb. Direct ubiquitination 
analysis is achieved by co-transfection of a tagged ubiquitin plasmid followed by 
immunoprecipitation of IkB and immunoblotting for the tagged ubiquitin. 

In yet other embodiments of the present invention, methods to determine 
whether slimb transfection can forceNF-KB activation in the absence of stimulation as 
a result of residual IKK activity, or with reduced levels of stimuli are conducted. 
Controls include slimb mutants that cannot bind IkB, and F-box proteins that do not 
associate with IkB (as described above). 

As described above, the present invention provides approaches for 
reconstruction the SCFCdc4 ubiquitin ligase pathway for the Cdk inhibitor Sicl (See 
also, Skowyra ef a/.. Cell 91:209 [1997]). This approach also finds use with slimb. 
First, in some embodiments, experiments are conducted to determine whether slimb 
immune, complexes from transfected ceils contain IkB ubiquitin ligase activity using 
phosphorylated IkB or IkB point mutants in phosphorylation sites as substrates. Cdc34 
is the most likely candidate for the E2, however other E2 are also tested (e.g., Ubc4. 
5, and 10). The development of this system provides a screening assay to. examine 
whether particular molecules function to block IkB ubiquitination. 
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In yet other embodiments of the present invention, methods are provided to 
determine whether interference with slimb inhibits NF-kB function and induces 
apoptosis. There is clear evidence that blocking NF-kB action in mammary tumor 
cells (578T) (Sovak et al., supra) and in other cell types (Van Antwerp et al., supra: 
Wang et al., Science 274:784 [1996]; and Wu et al., supra) can lead to apoptosis. In 
one set of experiments, synthetic peptides overlapping the IkB recognition sequence (in 
either the phosphorylated or unphosphorylated forms) are generated and microinjected 
(as described by Connell-Crowley et al., Curr. Biol. 8:65 [1997]) into 578T human 
mammary tumor cells. Apoptosis, as well as the fate of NF-kB/IkB. is assessed by 
TUNEL and immunofluorescence, respectively, using standard methods. In other 
embodiments methods are provided to test whether dominant negative slimb or 
appropriate controls will induce apoptosis. 

F. slimb/UB interaction surfaces as a target for drug design 
Due to the widespread interest in the generation of anti-NF-icB therapeutics, 
many steps in the NF-kB pathway are being targeted. Because the nature of the 
ubiquitin ligase for IkB was unknown in the art. prior to the present invention, this 
step had not yet been explored. The slimb complex of the present invention provides 
a novel target and provide means to identify anti-NF-KB therapeutics. For example, 
one major advantage of slimb is that it recognizes a small phosphopeptide sequence. It 
is contemplated that molecules that mimic this phosphopeptide and block NF-kB 
activation wili be identified using the method of the present invention. 

In some embodiments, the first steps in generating slimb/lKB interaction 
-surfaces involve identification of the molecular interaction surfaces (interacting motifs) 
between slimb and IkB, and identification of peptides or proteins that, by virtue of 
binding to slimb, block binding to IkB. These steps identify and provide motifs and 
assays that find use in screening combinatorial libraries for small molecule inhibitors 
of the interaction. As there are many alternative approaches that could be taken to 
identify molecular interaction surfaces, it is not intended that the present invention be 
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limited to any specific approach. Preferred approaches are illustrated below, although m 
the present invention is not limited to these particular approaches. 

In a first embodiment of the present invention, a modified version of the 
reverse two-hybrid approach (See e.g., Vidal et aL, Proc. Natl. Acad. Sci. 93:10315 

5 [1996]) is applied to identify point mutants in slimb that abolish IkB binding. This 

approach uses the power of genetics to screen a large library of point mutants in slimb 
(e.g., generated by either chemical mutagenesis or PCR using standard methods) to 
identify those that have lost the ability to bind to a target. Slimb mutants that fail to 
interact with IkB will be counter-screened for interaction with Skpl and for expression 

10 of full-leneth slimb mutant protein using methods similar to those illustrated in Figure 

12. In this Figure, phosphorylation-specific interaction of SCF slimb complexes with 
IkB peptide sequences were analyzed. Ly sates (1 mg) prepared as described above for 
Figure 10 were incubated with 10 microliters of affigel beads containing either the IkB 
peptide or the same peptide containing phosphoserine at both serine residues. Beads 

15 were washed three times with lysis buffer and bound proteins separated by SDS- 

PAGE. Proteins were transferred -to nitrocellulose and -used for immunoblotting with 
the indicated antibodies. A subset'of "slimb mutants (determined by sequencing) that 
pass the secondary tests likely reside in IkB contact sequences. 

Preferred interaction surfaces for use in screening assays are those that have 

20 clustered mutations (e.g., those that are localized nearby on the same surface). To 

ensure that mutations reflect an interaction site as opposed to structural alterations, the 
samples are assayed for second-site revertants in IkB that regenerate interaction with a 
mutant slimb protein. These studies, together with conventional deletion analysis 
provide information about the necessary and sufficient sequences in slimb. - 

25 In other embodiments of the present invention, consensus sequences "are 

determined for interaction with the phosphopeptide binding site(s) in slimb. The small 
size and simplicity of the IkB sequence makes it an attractive candidate for 
determining a consensus binding sequence. In one embodiment of the present 
invention, a peptide library approach is used to identify consensus sequences for 

"30 ~ phosphopeptid^recogniti^ _ Tl^tecfeiqu^"(5'ee e.g. ? Sor^yang~e/"a/— eell-72:767- 

- 77 - 



WO 99/18989 



POTUS98/21763 



[1993]; Songyang et at... Mot. Cell. Biol. 14:2777 [1994]; and Songyang et aL, J. Bio]. 
Chem. 270:14863 [1995]) involves applying a highly complex mixture of peptide 
sequences thai contain phosphoserines three residues apart (as in IkB), but are 
otherwise degenerate, to immobilized slimb or the minimal interaction domain 
identified above. After the column is washed, peptides are eluted and sequenced to 
determine consensus sequences. Individual sequences are then tested for binding. The 
goal here is to define how selective the interaction site is. It is known for instance 
that the spacing between the phosphoserines is required for IkB to be destroyed (Yaron 
et al. 9 supra). The elucidation of such a consensus provides a theoretical "sequence 
space" and a starting point for drug discovery. It is also contemplated that this motif 
will find use to search databases for other potential slimb substrates and/or regulators. 
In alternate embodiments, a particular peptide sequence in the context of two glutamic 
acids (which can mimic phosphoserine) may be able to associate with slimb. In yet 
other embodiments, peptide library experiments are performed with fixed glutamic 
acids to determine if any sequences exist that compete in a phosphorylation- 
independent manner. This provides a starting point fornon-phosphorylatable slimb 
inhibitors. To determine whether the peptides identified inactivate slimb in cells, 
peptides are microinjected into tissue culture cells and NF-kB function, as well as 
apoptosis, in 578T cells is determined (as described above). 

In other embodiments of the present invention, an alternative approach to the 
directed search for competitive binding components is used, which combines the 
complex nature of the human genome or peptide aptomer libraries coupled with the 
power of the reverse twojiybrid approach. In this embodiment, cDNA or peptide 
aptomer libraries are be transformed into yeast strains expressing GAL4-IkB and ACT- 
slimb and cells are selected for the loss of the iKB/sIimb interaction. Library plasmids 
are rescued and sequenced to identify binding components, with further analysis 
revealing whether these proteins/peptides disrupt the interaction by binding to one or 
both of the proteins. Peptide aptomers are then assessed as described above for 
synthetic peptides. 
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From these illustrative examples, it is clear that the present invention provides ^ 
means to develop anti-NF-KB therapies based on the blocking of IkB ubiquitination. 
More generally, the identification and characterization of siimb as a member of an 
SCF complex illustrates that the methods and compositions of the present invention are 
5 capable of identifying and isolating F-box proteins and detecting F-box protein targets 

and F-box protein complexes. 

EXPERIMENTAL 

The following examples are provided in order to demonstrate and further 
illustrate certain preferred embodiments and aspects of the present invention, and are 

10 not to be construed as limiting the scope thereof. 

In the experimental disclosure which follows, the following abbreviations 
apply: h (human); Sc {Saccharomyces cerevisiae); m (mouse); Ub (ubiquitin): El (Ub 
activating enzyme); E2 (Ub carrier protein); E3 (Ub-protein ligase): °C (degrees 
Centigrade); rpm (revolutions per minute); BSA (bovine serum albumin): CFA 

15 (complete Freund's adjuvant); IF A (incomplete Freund'-s- adjuvant): IgG 

(immunoglobulin G); IM (intramuscular); IP (intraperitoneal); IV (intravenous or 
intravascular); Sc (subcutaneous); H 2 0 (water); HC1 (hydrochloric acid): aa (amino 
acid); bp (base pair); kb (kilobase pair); kd (kilodaltons); gm (grams); ug 
(micrograms); mg (milligrams); ng (nanograms); \x\ (microliters); ml (milliliters); mm 

20 (millimeters); nm (nanometers); j.im (micrometers); M (molar); mM (millimolar); MW 

(molecular weight): sec(s) (second/seconds); min(s) (minute/minutes): hr(s) 
(hour/hours); MgCl 2 (magnesium chloride); NaCl (sodium chloride); DTT 
(dithiothreitol):'OD 2S0 (optical density at 280 nm); OD 600 (optical density at 600 nm); 
PAGE (polyacrylamide gel electrophoresis); PBS (phosphate buffered saline [150 mM 

25 NaCL 10 mM sodium phosphate buffer. pH 7.2]); PEG (polyethylene glycol); PMSF 

(phenylmethylsulfonyl fluoride); SDS (sodium dodecyl sulfate); SDS-PAGE (sodium 
dodecyl sulfate polyacrylamide gel electrophoresis); LMA (low melting temperature 
agarose gel; Tris (tris(hydroxymethyl)aminomethane); NETN (20 mM Tris-HCl, pH 8, 
100 mM~NadrrmM~EDfA ; 0.5% NP-40, 5~mM NaF7 3~0~mM p~- T 
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nitrophenylphosphate, 1 jig/ml each leupeptin and antipain, and 1 mM PMSF); TBST, 
(20 mM Tris (pH 8), 100 mM NaCl, 0.5%Tween-20); IPTG (isopropyl-P-D- 
thiogalactopyranoside); LB (Luria-Bertani medium; per liter: 10 g tryptone, 5 g yeast 
extract, 10 g NaCl, pH 7; sterilized by autoclaving for 20 minutes at 15 lbs/in 2 ); vol 
(volume); w/v (weight to volume); V/V (volume to volume); Amersham (Amersham 
Life Science, Inc., Arlington Heights, IL); ICN (ICN Pharmaceuticals, Inc., Costa 
Mesa, CA); Amicon (Amicon, Inc., Beverly, MA); ATCC (American Type Culture 
Collection, Rockville, MD); Becton Dickinson (Becton Dickinson Labware. Lincoln 
Park, NJ); BioRad (BioRad, Richmond, CA); Clontech (CLONTECH Laboratories, 
Palo Alto, CA); Difco (Difco Laboratories, Detroit, MI); GIBCO BRL or Gibco BRL 
(Life Technologies, Inc., Gaithersburg/ MD); Babco (Berkeley Antibody Company, 
Richmond, CA); Invitrogen (Invitrogen Corp., San Diego, CA); Kodak (Eastman 
Kodak Co., New Haven, CT); New England Biolabs (New England Biolabs. Inc.. 
Beverly, MA); Novagen (Novagen. Inc.. Madison, WI); Qiagen (Chatsworth. CA); 
Pharmacia (Pharmacia, Inc., Piscataway, NJ): Sigma (Sigma Chemical Co.. St. Louis, 
MO); Sorvall (Sorvall Instruments, a subsidiary of DuPont Co., Biotechnology 
Systems, Wilmington, DE); Stratagene (Stratagene Cloning Systems, La Jolla, CA); 
Whatman (Whatman LabSales, Hillsboro, OR); Bethyl Laboratories (Bethyl 
Laboratories, Montgomery, TX); and Zeiss (Carl Zeiss, Inc., Thornwood, NY). 

Unless otherwise indicated, all restriction enzymes were obtained from New 
England BioLabs and were used according to the manufacturer's instructions; all 
oligonucleotide primers, adapter and linkers were synthesized using standard 
methodologies on an ABI..DNA synthesizer. All chemicals were obtained from Sigma 
unless otherwise indicated. 
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EXAMPLE 1 
Preparation of Antibodies 

In this Example, anti-Skpl and ami-Sicl antibodies were prepared. Using 
standard methods as known in the art, anti-Skpl and anti-Sicl polyclonal antibodies 
were generated in rabbits, with bacterial Gst fusion protein described below, used as 
the antigen. 

A. Antigen Preparation 

Expression plasmids for GST-SKP1 and GST-SIC1 were generated by ligating 
open reading frames for the encoded proteins into pGEX2TK (Pharmacia), using 
established procedures known in the art (See e.g., J. Sambrook el al, supra). The 
Genbank accession numbers for SKP1 and SIC1 are U61764 and X78309, respectively. 

Plasmids were transformed into E. coli strain BL21 (DE3)(Novagen). For 
expression, 1 L of E. coli cells were grown in LB medium containing 0.1 mg/ml 
ampicillin at 37°C, until the OD 600 reached 0.8. Expression was induced with 400 mM 
IPTG for three hours. Cells were' harvested by centrifugation (2,000 xg for 10 
minutes), and then lysed in 70 ml NETN buffer for 30 minutes, on ice. The insoluble 
material was then removed by centrifugation (14,000 xg, for 20 minutes). The lysate 
was then incubated with 0.5 ml glutathione Sepharose (Pharmacia) for 1 hour at 4°C. 
The Sepharose beads were washed three times with 10 ml NETN buffer, and washed 
twice with 5 ml of 100 mM NaCl. and the protein was eluted with buffer containing 
0.5 nil 100 mM Tris (pH 7.5), 100 mM NaCl, 40 mM glutathione. The protein was 
then stored at -80 6 C, prior to its use in the affinity purification of antibodies. 



B. Antibody Production and Affinity Purification 

Polyclonal rabbit anti-Cdc34 and anti-Cdc4 sera (provided by M. Goebl), as 
well as anti-Sicl. were affinity purified using recombinant antigens immobilized on 
nitrocellulose. The anti-Skpl antibodies were not affinity purified. 

To affinity purify the anti-S icl antibodies, GST-Sicl protein (0.1 mg) was 

subjected to electrophoresis on a 12% polyacrylamide (SDS-PAGE) gel., the^rbtein 
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was blotted to nitrocellulose (3 hours, at 350 mA). Nitrocellulose filters containing . 
GST-Sicl protein were incubated with 1 ml of anti-Sicl antibodies for 3 hours, the 
filters were washed twice with 10 ml of buffer containing 50 mM Tris (pH 7.5), 50 
mM NaCl, 0.5% Tween-20, and then eluted with 1 ml of 100 mM" glycine (pH 2), and 
stored at 4°C until use. 

In addition to the anti-Skpl and anti-Sicl polyclonal rabbit antibodies generated 
in this Example, and the anti-Cdc34 and anti-Cdc4 polyclonal rabbit antibodies from 
Dr. Goebl, monoclonal antibodies were also used in the following Examples. These 
commercially available monoclonal antibodies were obtained from Babco (anti-HA, 
anti-Myc), Novagen (anti-T7 gene 10, [i.e., "G10"]), and Kodak (anti-Flag, M2). 



EXAMPLE 2 
Expression, Purification and Phosphorylation 
of Recombinant Proteins 

In this Example, recombinant proteins were expressed, purified and 
phosphorylated. In these experiments, insect cells and baculoviruses were used. 
Baculovirus expression vectors were generated in this Example using the vectors in 
combination with linearized BaculoGoId or AcMNPV wild-type DNA (Pharmingen). 
The viruses, their tags, and base vectors are listed in Table 1 . 

Cdc4AWD is a mutant version of Cdc4 that contains a stop codon at residue 
566. which removes the last three WD-40 repeats. Gst-Cdc2SHA (D154N). also 
referred to as n Gst-Cdc28HA(K-)," is a kinase-impaired form of Cdc28. In complexes 
with- either Clnl or Clb5, this kinase was found to exhibit <2% activity toward histone 
HI. 

For expression of His 6 Cdc34 and His 6 -Sich plasmids were transformed into 
BL21 (DE3) cells (Novagen). One liter of cells were grown in LB containing 0.1 
mg/ml ampicillin, at 37°C ? until an OD 600 of 0.8 was reached. Expression was then 
induced with 400 mM IPTG for three hours. Cells were harvested by centrifugation 
" X^OOO xg71^r0~rmm^ " 
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7.5) containing 500 mM NaCl , and O.i mg/ml lysozyme (Sigma), and incubated for ^ 
45 minutes on ice. Insoluble material was removed by centrifugationX14 ; 000 xg, for 
20 minutes). The lysate was then incubated with 0.5 ml NV 2 -NTA (Qiagen) resin as 
directed by the manufacturer. The protein was eluted with 20 mM sodium phosphate 
(pH 6) containing 500 mM NaCl and 200 mM imidazole, and stored at -80°C 



Table 1. Baculovirus Expression Vectors 



Virus 


Tag 


Base Vector 


Cakl 


None 


pVL 


Cdc4 


None 


pBBIII 


Cdc4AWD 


None 


pBBIII 


Cdc4 F 


C-terminal Flag 


pBBIII 


Cdc34 


None 


pBBIII 


Cdc53 M 


N-terminal Myc 


pBBIII 


Clb5 


• _ None 


pVL 


/-*<i i HA 

Clnl HA 


C-terminal HA 


nRRTTI 


Cln2 HA 


C-terminal HA 


pVL 


Gst-Cdc28 HA 


N-terminal Gst 
C-terminal HA 


pVL 


Gst-Cdc28 HA (D154N) 


N-terminal Gst 
C-terminal HA 


pVL 


. Grrl Gl ° 


N-terminal His 6 -G10 


pBBHis 


His 6 -Cksl 


N-terminal His 6 


pVL 


Sicl 


None 


pBBIII 


Skpl 


None 


pVL 


Skpl F 


N-terminal Flag 


pBBIII 


Gst-Skpl 


N-terminal Gst 


pVL 
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For recombinant protein expression and assembly of complexes. 4 x 10 5 insect ^ 
cells (Hi5, Invitrogen) were infected with the indicated virus combinations for 40 
hours. These combinations included baculoviruses expressing Myc-tagged Cdc53 
(Cdc53 M ), Cdc34, Cdc4, and Skpl. Cells were then harvested and disrupted in lysis 

•5. buffer (50 mM Tris-HCl (pH 7.5), 150 mM NaCI, 0.5% Nonidet P40 ? 10 mM NaF, 10 

mM p-glycerol phosphate, 1 mM PMSF, and 5 [ig/ml each leupeptin, antipain, and 
aprotinin). For isolation of protein complexes, typically about 3 ml of lysis buffer was 
used per 0.5 x 10 3 cells. 

To examine the assembly of recombinant yeast proteins, 0.4 ml lysate were 

10 typically derived from 2 x 10 6 cells. In both cases, cell lysates were centrifuged for 2 

minutes at 14..000 xg, prior to affinity- or immuno-purification. Immunopurification 
was performed by incubating the lysates at 4°C for 2 hours with 4 ug of the-anti-Myc 
or anti-GlO antibody and 8 ul of Protein A-Sepharose, or with 8 ul of immobilized 
. anti-Flag antibodies (Kodak; See, Example 1). Immune complexes were washed three 

15 times with 1 ml of lysis butter prior to SDS-PAGE. 

For SDS-PAGE, an equal volume of 2X sample- buffer (250 mM Tris (pH 6.8) ; 
4% SDS ; 20% glycerol, 10% 2-mercaptoethanol) was added to the samples to be 
tested, and boiled for 2 minutes. Samples were then electrophoresed in 12% 
polyacrylamide gels with 35 mA constant current. Proteins were transferred to 

20 nitrocellulose filters using a BioRad transfer apparatus in 50 mM Tris/glycine buffer 

(pH 8), containing 20% methanol, for three hours, at 350 mA. The nitrocellulose 
filters were then blocked with 5% non-fat dry milk solution for 1 hour, followed by 
incubation overnight with primary antibody. The antibody dilution used was. 1 : 1000 
for anti-Cdc4, anti-Cdc34 1 anti-gene 1 0, anti-Sicl, anti-myc, and anti-HA; the anti-Skpl 

25 antibody was diluted 1:4000. Blots were washed in TBST (20 mM Tris (pH 8), 100 

mM NaCI, 0.5% Tween-20) for 30 minutes, and then incubated with either goat anti- 
rabbit conjugated horseradish peroxidase (HRP) or rabbit anti-goat conjugated HRP 
(Promega), as appropriate, at a dilution of 1:25,000, for 30 minutes. Immunoblots 
.were then washed with TBST for 30 minutes, and developed using enhanced 

30 chemiluminescence detection (Amersham) as described by the supplier: ' 
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As shown in Figure 1A : in the presence of all four proteins (Cdc53 M ? Cdc34, . 
Cdc4, and Skpl), anti-Cdc53 M complexes contained Cdc4 5 Cdc34, and Skpl. 
However, in the absence of Skpl, only low levels of Cdc4 bound to Cdc53 M , 
regardless of the presence of Cdc34 (Figure 1A, lanes 7 and S). This result was 
confirmed through the analysis of Cdc53 M association with anti-Cdc4 immune 
complexes (See, Figure IB). Thus, Skpl was shown to facilitate association of Cdc53 
with Cdc4. In contrast, both Skpl and Cdc34 can simultaneously associate with 
Cdc53 M in the absence of other yeast proteins (See; Figures 1A and 1C). Together, 
these data indicated that Cdc34, Cdc53, Skpl, and Cdc4 form a multiprotein complex. 

A. Sicl/Clb5/Gst-Cdc28HA(K-) Complexes 

Sicl/Clb5/Gst-Cdc28HA(K-) complexes were purified from 4 x 10 s cells, as 
described by Connell-Crowley et al. (Connell-Crowley a ai. Mol. Biol. Cell., 8:287- 
301 [1997]). Briefly, eight T-150 flasks of insect cells (Highfive. Invitrogen) were 
infected with 1 ml each of baculoviruses expressing either GST-Cdc2SHA, ClnlHA, 
Cksl, and CakL or baculoviruses expressing Gst-Cdc28HA(K-), Clb5. and Sicl. After 
40 hours, the cells were lysed at ~4°C/in 6 ml of NETN (20 mM Tris-HCl, pH 8, 100 
mM NaCl, 1 mM EDTA, 0.5% NP-40, 5 mM NaF, 30 mM p-nitrophenylphosphate, 1 
\xglm\ each leupeptin and antipain, and 1 mM PMSF). Lysates were cleared by 
centrifugation at 14,000 xg for 10 minutes. Supernatants were rotated with 0.2 ml of 
GSH-Sepharose for 60 minutes at 4°C. and the beads were washed three times with 2 
ml of the lysis buffer, followed by two washes with 100 mM Tris (pH 8), 100 mM 
NaCl. Proteins were then eluted with 0.2 ml of 100 mM Tris (pH 8). 100 mM NaCL 
40 mM glutathione (Sigma), and 10% glycerol. The proteins were then stored at 
-80°C until use. 
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B. Gst-Cdc28HA/ClnHA/Cksl and 
Gst-Cdc23HA(K-)/CInHA7Cksl Complexes 

Gst-Gdc23HA/ClnHA/Cksl (i.e.., ,, ClnHA/Gst-Cdc28HA/Cksl n in the legend 
for Figure 2A) and kinase impaired Gst-Cdc28HA(K-)/ClnlHA/Cksl complexes were 
prepared as described above, as were cells co-infected with viruses expressing 
appropriate proteins, and CAK1 expressing virus generated from a cDNA generously 
provided by C. Mann (See, Thuret et aL, Cell 86:565-576 [1996]). The presence of 
Cksl and Cakl resulted in a 5-fold increase in the yield of active Cln/Cdc28 kinase 
complexes, as purified after insect cell co-infection (determined using histone HI as a 
substrate). Figure 2A shows an SDS-PAGE analysis of purified Cdc2SHA/CksL In 
this Figure, the asterisk indicates the position of endogenous GST protein. 

C. Phosphorylated Sicl Complexes 

Phosphorylated Sicl complexes were generated by incubating 2.5 uM 
Sicl/Clb5/Gst-Cdc28HA(K-) with Gst-Cdc28HA/CInlHA/Cksl (50 niM) and 1 mM 
ATP in kinase buffer (50 mMTris HCl .(pH 7.5), 50 mM NaCK 10 mM MgCl 2 ) for 45 
minutes at 25°C. Control unphosphorylated Sicl complexes were produced in an 
identical fashion by omitting Clnl kinase. Cln/Cdc28 autophosphorylation was 
performed by incubating 200 nM Cln/Cdc28 complexes with 1 mM ATP in kinase 
buffer at 25°C for 1 hour. To generate phosphorylated Sicl free of Cln/Cdc28 kinase, 
bacterial Sicl (0.5 |iM) was incubated with 2 mM ATP and Cln2/Gst-Cdc28/Cksl 
immobilized on GST-Sepharose (Pharmacia) for 60 minutes at 37°C. Forty ng of 
phosphorylated Sicl were removed from the beads for use in ubiquitination reactions, 
at a final concentration of 1 nM. For 33 P-labeling of Sicl and Clnl proieins. kinase 
reactions were performed at 25°C for 30 minutes, using 50 jxM (y-" 2 P ATP (0.3 
nCi/pmol)) followed by incubation with 1 mM unlabeled ATP for an additional 30 
minutes. 

Figure 2B shows the gel results of phosphorylation of Sicl by Clnl/Cdc28 
complexes in vitro. The result for Sicl/Clb5/Gst-Cdc28HA (K-) incubated with ATP 
are shown in lane L while the result for Clnl/Cdc28 and ATP is shown in lane 2~ 
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Lane 3 shows the reaction products obtained when Clnl/Cdc28 complexes alone were , 
incubated with y- 32 P ATP. In lanes 4 and 5, the results from experiments in which 
smaller amounts of Sicl phosphorylation reactions with 50 nM of Sicl were performed 
in the presence of y- 32 P ATP. 

D. Grrl Complexes 

The Grrl complexes were prepared by infecting one T-150 flask of insect cells 
as described above, with baculoviruses expressing GrrlGlO, SkpL and Cdc53 M , or 
variations thereof. Forty hours after infection, the cells were lysed in 3 ml of NETN, 
and the lysates cleared by centrifugation at 14 : 000 xg for 10 minutes. Ten percent of 
each lysate was used for immunoprecipitation with 5 jig of anti-gene 10 antibodies 
(Novagen), and 8 ul of protein A-Sepharose (4°C for 90 minutes). The immune 
complexes were washed three times with 1 ml NETN prior to use in binding 
experiments or ubiquitination reactions. 

The complexes were immunoprecipitated with either (A) a tVtyc tag on Cdc53 
(Cdc53 M ) using anti-Myc antibodies or <B_) a Flag tag on Cdc4 (Cdc4 F ) as described in 
Example 3. Immune complexes were immunoblotted and probed with anti-Myc to 
detect Cdc53 M . anti-Cdc4 ? anti-Cdc34, and anti-Skpl as described in Example 3 (See, 
Figure 1). 

EXAMPLE 3 
//; Vitro Binding Assays 
Binding reactions were performed at 4°C for 1 hour, in 100-250 ml mixtures 
containing appropriate immunopurified complexes prepared as described in Example 2, 
and affinity purified Sicl (20 nM) or Cln (2 nM) complexes. Associated proteins were 
then washed three times with 1 ml of lysis buffer prior to SDS-PAGE and 
irnmunoblotting, were performed as described above. 

In some experiments, 32 P-labeled Sicl or Cln complexes were employed at 
similar concentrations, and detected by autoradiography and phosphoimager analysis. 
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Based on protein staining with Coomassie Blue or silver, the quantities of proteins in . 
anti-Skpl F immune complex from Skpl F /Cdc53 M /Cdc4 expression cells was estimated 
to be: Skpl F (1 ug), Cdc53 M (200 ng), and Cdc4 (200 ng). Likewise, the levels of 
proteins in the anti-GrrlGlO complex were: GrrlGlO (100 ng), Cdc53 M (40 ng), and 
Skpl (20 ng). 

In additional experiments, insect cells were co-infected with constant quantities 
of baculovirus expressing Skpl F and increasing quantities of baculoviruses expressing 
either Cdc4, or a" C-terminal truncated form of Cdc4 lacking the last three WD-40 
repeats (i.e., Cdc4A\VD; lanes 12-17). Lysates were immunoprecipitated with anti- 
Flag antibodies to precipitate Skpl F complexes. Binding reactions with phosphorylated 
Sicl complexes and detection of bound protein were performed as described above. 

Figure 2C indicates that phosphorylation of Sicl -is- required "for its association 
with Cdc34/Cdc53/Skpl/Cdc4 complexes. As shown in this Figure, phosphorylated 
Sicl efficiently associates with Cdc53/Skpl/Cdc4 complexes, and this association is 
dependent upon the presence of Skpl (See, Figure 2C, lanes 6 and 8). Typically, 10- 
20% of the input phosphorylated Sicl was_ bound at about 20 nM Sicl. In contrast, 
the extent of binding of unphosphorylated Sicl (lane 7) was comparable to that 
observed in control immune complexes .generated from uninfected cells (lane 3) and 
was <\% of the input Sicl. Consistent with the results in Figure I , the level of Cdc4 
found in immune complexes lacking Skpl were > 10-fold lower than that found in the 
presence of Skpl. These data suggest that Cdc4 and/or Skpl function as binding 
factors for Sicl and that association of Sicl with this complex requires 
phosphorylation by Clnl/Cdc28. 

Figure 2D shows that association of phosphorylated Sicl with Cdc4 is 
enhanced by Skpl. In this Figure, lanes 3-9 contain anti-Flag immune complexes 
derived from cells infected with constant high quantities of a baculovirus expressing 
Cdc4 F , while lanes 4-10 contain increasing quantities of a baculovirus expressing Skpl 
in in vitro binding reactions with purified Clnl/Cdc28-phosphorylated Sicl. While 
Skpl alone did not interact with Sicl, it stimulated association of Sicl with Cdc4 by 
about 5-fold (Figure 2D). The weak association of Sicl with Cdc4 alone (Figure 2D, 
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lane 3) may reflect the participation of an insect cell Skpl homolog. The results 
described herein clearly demonstrate a positive contribution of Skpl in the Cdc4/Sicl 
interaction. 

Figure 2E shows thai association of phosphorylated Sicl with Skpl requires the 
5 WD-40 repeats of Cdc4. In this Figure, lanes 4-9 contain proteins obtained from 

insect cells co-infected with constant quantities of baculovirus expressing Skpl F , and 
increasing quantities of baculoviruses expressing Cdc4, while lanes 12-17 contain 
lysates from cells co-infected with constant quantities of baculovirus expressing Skpl F 
and increasing quantities of baculoviruses expressing a C-terminal truncated form of 
10 Cdc4 lacking the last three WD-40 repeats (/.*.. Cdc4AWD). Association of 

phosphorylated Sicl with anti-Skpl F immune complexes was absolutely dependent 
upon the presence of Cdc4 (See. Figure 2E, lanes 3 and 9). Moreover, deleting the 
last three WD-40 repeats from the C-terminus of Cdc4 abolished its ability to associate 
with phosphorylated Sicl (Figure 2E. lanes 10-16). Therefore, Cdc4 functions as the 
15 specificity factor for binding of phosphorylated Sicl and the Cdc4-Sic! interaction 

requires an intact WD-40 repeat domain in Cdc4. 

EXAMPLE 4 
Ubiquitination Assays 
In this Example, ubiquitination reactions were conducted. In these experiments, 
Ni 2 *-NTA resin was used to isolate ubiquitinated proteins from extracts of wild-type 
cells or sic\ deletion mutants expressing His^-Ub^ or Ub RA (Willems et aL [1996]. 
supra). In addition, once the strategy to generate Cdc4/Skpl/Cdc53 complexes that 
recognized phosphorylated Sicl was developed, experiments to determine whether 
these complexes can catalyze ubiquitination of Sicl in vitro when supplemented with 
Cdc34, El, ATP, and ubiquitin were conducted. 

In some experiments, bacterial Sicl was used and where indicated, was 
phosphorylated with soluble or immobilized Gst-Cdc28HA/Cln2HA prior to use. 
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Bacterial Sicl ubiquitination reactions employed 100 nM yeast El (a gift' from S. Sadis 
and D. Firley, Department of Cell Biology, Harvard Medical School). 

A. Ubiquitination of Sicl In Vivo 

To identify Sicl-ubiquitin conjugates in vivo, 200 ml (10 7 cells/ml) of wild-, 
type (MT235), or a sicl deletion (MT767) cells expressing either pCUPl-UBl*" 
(<pUB204>) or pCUPl-UBl" 1 ***™'** (<pUB223>) were prepared, and lysates were 
generated in 50 mM Tris-HCl (pH 7.5), 100 mM NaCl, 0.1% NP-40 : 1 mM PMSF, 
0.6 miM dimethylaminopurine, 1 fj-g/ml leupeptin, 1 ug/ml pepstatin, 10 ug/ml Tosyl- 
phenyl chloromethyl ketone and 10 jag/ml soybean trypsin inhibitor as described by 
Willems et aL (Willems a ai. Cell 86:453-463 [1996]). Briefly, 8 ug of yeast protein 
was incubated with 12 \x\ of N+ 2 NTA beads (Qiagen) for 1 hour at 4°C. as described 
by the manufacturer. The beads were then washed 3 times in lysis buffer, 1 time in 
high salt buffer (50 mM Tris-HCl, pH 8.0 r 0.5 M NaCl) ? and the proteins were eluted 
with 10 ul of 100 mM Tris-HCl. pH 6.8, 1% SDS, 100 uM DTT. 100 \xU EDTA. 
Proteins were separated by SDS'-PAGEand immunoblotted with anti-Sicl antibodies, 
as described above. 

Figure 3 shows that phosphorylated Sicl is ubiquitinated in vivo and in vitro 
with purified Cdc34 E2 and Cdc53/Skpl/Cdc4 complexes. In this Figure, the position 
of Sicl and Sicl-ubiquitin conjugates are indicated (i.e.. "Sicl'.' and "Sicl-Ub," 
respectively). 

B. Reconstitution of the Sicl Ubiquitination Pathway Using 
Recombinant Proteins 

Ubiquitination reactions contained immune complexes prepared from 2 x 10 6 
cells and equilibrated with ubiquitination buffer (100 mM Tris-HCl (pH 7.5), 5 mM 
MgCl : , 0.6 mM DTT), 500 nM bacterial Cdc34, 300 nM human El (a gift from M. 
Rolfe ; Mitotix), 2 mM ATP, and 7 mM yeast ubiquitin (Sigma) or Gst-UB 1 ^ (purified 
from bacteria expressing GEX-U6 RA [provided by M. Tyers. University of Toronto], 
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and the method described in Example 1 for GST-Skpl), and 80 ng of Sicl complexes g 
in a final volume of 14^1, excluding bead volume. The human El purification was 
described by Rolfe et ai (Rolfe .*/ gl., Proc. Natl. Acad. Sci. USA 92:3264-3268 
[1995]). Reactions were allowed to proceed at 25°C for 1 hour or as indicated, 
quenched with 2X sample buffer (250 mM Tris (pH 6.8), 4% SDS, 20% glycerol, and 
10% 2-mercaptoethanol), and analyzed by SDS-PAGE and immunoblotting with anti- 
Sicl antibodies as described above. 

The results indicated that in the presence of all reaction components, 
phosphoryiated Sicl was efficiently converted to higher molecular weight conjugates 
detectable with anti-Sicl antibodies (See, Figure 3B, lane 6; and Figure 3C ? lane 5). 
In contrast/ unphosphorylated Sicl -was not detectably ubiquitinated. Sicl 
ubiquitination absolutely required Cdc34. Cdc4, Cdc53, SkpL El and ubiquitin (See 
e.g, Figure 3B and Figure 3C). The pattern of high molecular weight Sicl conjugates 
obtained in reactions with ubiquitin was different from that observed when Gst-Ub 1 ^ 
was used &s the ubiquitination source as shown in lanes 5 and 1 1 of Figure 3C. These 
results confirm that the high molecular weight forms observed are products of 
ubiquitination. With Gst-Ub RA , the Sicl reaction products were integrated into a 
ladder of bands differing by approximately 35 kDa, the size of Gst-Ub^ (See, Figure 
3C lane 11). 

The ubiquitination reaction was time dependent and the reaction efficiency 
ranged from 10-40% of the input Sicl protein (Figures 3B and 3C). When the 
reaction was performed with pre-bound Sicl, the efficiency was greater than 50%. In 
addition, greater than 50% of the Sicl ubiquitin conjugates formed after 60 minutes 
were found to have dissociated from the Cdc4/Skpl/Cdc53 complex. In addition, 
neither Gst-Cdc28. Clb5, Cdc53, Skpl, or .Cdc4 formed ubiquitin conjugates under the 
reaction conditions employed, although Cdc34 was ubiquitinated as previously reported 
(Haas et aL J. Biol. Chem. ? 266:5104-5112 [1991]). 
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C. Ubiquitination of Sicl in Association with CIb5-Cdc28 Complexes 
To test whether Sicl ubiquitination requires association with Clb5/Cdc28 
complexes, ubiquitination reactions using Sicl produced in bacteria, with or without 
phosphorylation with Cln2/Cdc28 were performed as described above, with yeast E2 
5 replacing human EL To verify the absence of Cln2HA/Cdc28HA in the ubiqutination 

reaction, Sicl proteins were also irnmunoblotted with anti-HA antibodies. 

The results shown in Figure 3D indicate that ubiquitination of Sicl does not 
require that Cln/Cdc28 be present in the ubiquitination reaction, nor that Sicl be 
associated with Clb5/Cdc28. In this Figure, lane 1 contains Sicl purified from 
10 bacteria, while lane 2 contains Sicl treated with soluble CIn2/Gst-Cdc28 ? and lane 3 

contains immobilized Cln2/Gst-Cdc28. Use of phosphorylated Sicl that was free of 
* Cln2 kinase. is indicated by an asterisk. (lanes- 3 and 9). As in the case of Sicl 
assembled in insect cells with Clb5/Cdc28, phosphorylated Sicl from bacteria was 
efficiently ubiquitinated. with greater than 90% of the Sicl forming ubiquitin 
15 conjugates (lane 8) ; and ubiquitination absolutely required Sicl phosphorylation (lane 

Although phosphorylation of Sicl was required for its recognition by Cdc4 and 
Skpl, it remained possible that Cln/Cdc28, present in small amounts in the 
ubiquitination reaction, is also required for additional steps in the ubiquitination 

20 - process, for instance, to phosphorylate the ubiquitination machinery. To rule out this 

caveat, bacterial Sicl was treated with Cln2/Gst-Cdc28 complexes .immobilized on Gst- 
Sepharose beads, removed from the beads prior to use in ubiquitination reactions, and 
determined to be free of soluble kinase by immunoblotting with anti-HA antibodies. 
These results are shown in Figure 3D, lane 3. Sicl phosphorylated in this manner was 

25 also efficiently ubiquitinated (See, Figure 3D, lane 9). These data indicate that Sicl 

phosphorylation constitutes the primary requirement of Cln/Cdc28 kinases in Sicl 
ubiquitination in the in vitro reaction. 
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D. Clb5/Cdc28-phosphoryIated Sicl as a Substrate for Ubiquitinatiori 

In these experiments, it was found that Clb5/Cdc28-phosphoryiated Sicl was 
also a substrate for ubiquitination. In these experiments, constant amounts of Sicl 
were treated with increasing amounts of Clb5/Cdc28, until the kinase was in excess as 
determined by histone kinase assays. Under these conditions, Sicl electrophoretic 
mobility was reduced (Figure 3E, lanes 1-6, top), as determined by immunoblotting. 

Aliquots of differentially phosphorylated Sicl were used in ubiquitination 
reactions withimmunopurified Cdc53 M /Cdc4/Skpl complexes supplemented with 
Cdc34, El, ubiquitin, and ATP for 30 minutes, as described above (See. Figure 3E, 
lanes 1-6). As a negative control, partially phosphorylated Sicl corresponding to the 
Sicl protein in lane 5 (top) of Figure 3E, was reacted in the absence of Cdc34 (lane 7) 
or the Cdc53 M /Cdc4/Skpi complex (Figure 3E. lane 8). Sicl ubiquitination was 
determined by immunoblotting with anti-Sicl antibodies (Figure 3E, bottom). 

Although Sicl is an inhibitor of Cdc28/Clb5 complexes, when the kinase 
complex was in excess of Sicl, Sicl was phosphorylated as determined both by 
reduced electrophoretic mobility (See, Figure 3E) and A2 P incorporation. This result 
may explain the fact that' overexpression of CLD5 can drive S-phase entry in cln- cells, 
and suggests that active Clb5/Cdc28 formed during Sicl destruction may collaborate 
with Cln/Cdc28 to complete the Sicl ,ubiquitination process. 

E. Sicl Binding and Ubiquitination with Grrl 

In these experiments, the Cdc4 was substituted with another F-box protein 
(Grrl) in order to determine if this protein could support Sicl. binding and 
ubiquitination. Grrl has an F-box near its N-terminus and can interact simultaneously 
with Skpl and Cdc53 when co-expressed in insect cells (See e.g., Figure A). 

These experiments were conducted as described above, with the exception 
being that Grrl was substituted for Cdc4 (approximately 100 ng). Proteins were 
separated by SDS-PAGE, and blotted with anti-Skpl anti-Myc to detect GrrlGlO and 
Cdc53 N \ with anti-Skpl antibodies. 
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It was found that that Grrl and Cdc4 with Skpl /Cdc53 are mutually exclusive.^ 
In contrast with Cdc4, it was not possible to demonstrate enhancement of the 
Grrl/Cdc53 interaction in insect cells by co-expression of Skph even though Skpl 
assembled with these complexes. Importantly, Grrl assembled with Cdc53/Skpl 
complexes was unable to associate with phosphorylated Sic 1 , and was unable to 
support ubiquitination of phosphorylated Sicl complexes in the in vitro system with 
purified proteins under conditions where Cdc4 readily facilitated Sicl binding and 
ubiquitination (See, Figure 4B and 4C). Therefore, F-box proteins display selectivity 
toward particular targets. 

Figure 4A shows that Grrl can associate with Skpl and Cdc53 ? while Figure 
48 shows that phosphorylated Sicl associates with Cdc4 but not Girl-containing 
complexes. In this Figure, lanes 2o contain anti-Skpi F immune complexes derived 
from insect cells infected with the indicated baculovirus combinations were used for 
binding reactions with 32 P-labeled Sicl complexes. Ten percent of the input Sicl 
complex (lane 1) was included as a control. The presence of Cdc4, SkpL Cdc53, and 
Grrl was verified by immunoblotting. 

Figure 4C shows that Cdc4, but not Grrl, supports ubiquitination of Sicl in 
vitro. The indicated anti-Skpl F immune complexes were used in ubiquitination assays 
as described for Figure 3 (above) employing Gst-Ub^ as the ubiquitin source. 
Finally, Figure 4 shows the results verifying the presence of reaction components 
derived from immunoprecipitation (in this Figure, the blot used for ubiquitination 
assays was reprobed to detect GrrlGlO/ Cdc53 M ; and Cdc4). 



EXAMPLE 4 
Binding of Grrl to Clnl and Cln2 

In this Example, the binding of Grrl to Clnl and Cln2 was investigated. In 
particular, experiments were conducted in order to determine whether Grrl binds to 
Clnl and/or. Cln2 in a phosphorylation-dependent manner. Indeed, the finding that 
Sicl is recognized by the F-box protein Cdc4, together with a g^Siyie^iremehrfor 
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the F-box protein Grrl in Cln destruction, led to these experiments to examine whether 
Grrl functions in recognition of phosphorylated Clns. 

To generate Cln proteins for binding reactions, Cln/Gst-Cdc2S/Cks complexes 
were isolated from insect cells as described in Example 2B. In the presence of ATP, 
5 both Clnl and Cln2 are autophosphorylated, a modification that reduces their 

electrophoretic mobility (see below). To examine whether Grrl can associate with 
phosphorylated Clns and to compare the extent of selectivity of Grrl and Cdc4 toward 
Cln binding, anti-Skpl F . immune complexes from cells co-expressing Grrl or Cdc4 in 
the presence or absence of Cdc53 prepared as described above, were used in binding 

10 reactions with 32 P-labeled Clnl or Cln2 kinase complexes. 32 P-labeled Sicl was used as 

a control for Cdc4 binding. 

As shown in Figure 5 A. both Clnl and Cln2 complexes associated with 
GnT/Skpl K /Cdc53 complexes with an efficiency of about- 40% of the input Clnl or 
Cln2 (Figure 5 A, lanes 5 and 12), and this association did not require Cdc53 (Figure 

15 5 A, lane 16). In contrast, about 6% of the input Cln proteins associated with 

Cdc4/Skpl r complexes independent -of .the presence of Gdc53 (Figure 5A. lanes 7, 11, 
and 15) ; compared with 1% association in the absence of an F-box protein (Figure 5 A, 
lanes 6, 10, and 14). The extent of selectivity of these F-box proteins for Cln and 
Sicl is further reflected by the observation that Clnl protein present in the 

20 phosphorylated Sicl preparation was selectively enriched in Grrl complexes (Figure 

5A. lane 4). In this Figure, controls for the extent of binding (indicated by the 
asterisk) 1 were 20% of input Cln and 10% of input Sicl. The presence of all proteins 
in the binding reaction was confirmed by immunoblotiing (Figure 5B: in this Figure, 
complexes used for binding experiments in Figure 5 A were immunoblotted with the 

25 indicated antibodies to verify the presence of Cdc4, GrrlGlO, Cdc53 M . and Skpl F ), 

and the quantities of Cdc4 and Grrl were found to be comparable, based on 
Coomassie staining of SDS gels of immune complexes. Thus, Grrl and Cdc4 display 
specificity toward physiological substrates. 

Next, Grrl alone or in complexes with Skpl or Skpl/Cdc53 were 

30 immunoprecipitated from insect cell lysates and used in binding assays wuh 
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phosphorylated or unphosphorylated Clnl complexes prepared as described above. 
The results are shown in Figure 5C. As shown, unphosphorylated Clnl was produced 
in insect cells as a complex with kinase deficient Gst-Cdc28(K-), which minimized 
Clnl autophosphorylation during expression, and allowed the role of phosphorylation 
to be tested. Figure 5C, lane 1 shows that, as isolated, this Clnl protein migrates as a 
homogeneous species at approximately 66 kDa. In contrast, phosphorylated Clnl 
(Figure 5C, lane 2) undergoes a dramatic mobility shift to approximately 80 kDa, 
consistent with in vivo observations. Lanes 4-1 1 contain anti-GrrlGlO complexes 
derived from the indicated insect cell infections used in binding reactions with either 
unphosphorylated Clnl HA complexes generated using kinase impaired Gst-Cdc28(K- 
)HA (Figure 5C, lane 1) or phosphorylated ClnlHA/Gst-Cdc28HA complexes (Figure 
5C r lane 2). Anti-HA antibodies were used to detect Clnl HA and Gst-Cdc28HA. 
Twenty percent of the input Clnl HA complexes were run as controls (Figure 5C, lanes 
1 and 2). Clnl HA isolated from insect cells in complexes with active Cdc28 migrated 
as a series of modified forms, reflecting partial phosphorylation of Cln in vivo in insect 
cells (See, Figure 2A). Incubation of such ClnHA/Cdc28HA complexes with ATP 
quantitatively shifts Clnl HA to a single form migrating as an approximately 84. kDa - 
protein. The blot was reprobed to verify the presence of GrrlGlO, Cdc53 N \ and 
Skpl F . 

Phosphorylated Clnl (and its associated Cdc28 protein) efficiently associated 
with all Grrl complexes (Figure 5C. lanes 6, 8, and 10). but was absent from control 
binding reactions lacking Grrl . (Figure 5C, lane 4). In contrast, the levels of 
unphosphorylated Clnl associated with Grrl complexes were comparable to that found 
in binding reactions lacking Grrl (Figure 5C lanes 3. 5, 7, and 9). 

It was also determined that purified Skpl/Cdc53/Grrl complexes are not 
sufficient for Clnl ubiquitination by Cdc34 in vitro. Thus, association of both Clnl 
with Grrl and Sicl with Cdc4 was found to be greatly enhanced by phosphorylation. 
Anti-Skpl F immune complexes were purified from insect cells infected with the 
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indicated baculoviruses and supplemented with El, Cdc34, Gst-Ub**, ATP, and either , 
32 P-labeled Sicl or Clnl, as described above (e.g., Figure 3). 

As shown in Figure 5D, although the Grrl/Skpl/Cdc53 complex is capable of 
binding efficiently to phosphorylated Clnl, it was not competent for Clnl 
5 ubiquitination when supplemented with Cdc34 and EL Moreover, Figure 5D shows 

that Cdc4 complexes that functioned in Sicl ubiquitination also failed to catalyze 
ubiquitination of Clnl, despite the fact that Clnl can associate, albeit weakly, with 
Cdc4 (See, Figure 5 A). In contrast, identical preparations of phosphorylated Clnl 
protein were efficiently ubiquitinated in partially purified yeast lysates in a Cdc34 
10 dependent manner (Figure 5E), indicating that this preparation of Clnl is competent 

for ubiquitination. 



EXAMPLE 5 
Ubiquitination of Phosphorylated Clnl 

• In this Example, preparations- of. phosphorylated Clnl (as described above), 
15 were ubiquitinated in partially' purified yeast lysates in a Cdc34-dependent > manner. 

In these experiments, 0-100 jig YFII (a 250 mM NaCl eluate from a DEAE- 
cellulose column prepared exactly as described in Deshaies et ai [1995], supra) was 
supplemented with 500 nM Cdc34, 100 nM human El, ubiquitin, and an ATP 
regenerating system (2 mM ATP. 600 mM creatine phosphate, and 0.15 mg/ml 
20 creatine kinase). The ubiquitination reaction was initiated by addition of 20 ng 

ClnlHA/Gst-Cdc28HA/Cksl. After incubation for 60 minutes at 25°C, the. reactions 
were quenched and immunoblotted with anti-HA antibodies to detect Clnl HA and Gst- 
Cdc28HA. In Figure 5E, the protein indicated by an asterisk is a yeast protein in YFII 
that cross-reacts with the anti-HA antibodies used. As indicated in this Figure, this 
-25 preparation of Clnl is competent for ubiquitination. 
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EXAMPLE 6 
Identification of Human F-Box Proteins 

In this Example, new human F-box proteins were identified, using a two hybrid 
system. The SKP1 open reading frame (as an NdeVBamHl restriction fragment) was 
subcloned into pAS2 {See, Harper et ai, Cell 75:805-816 [1993]). pAS2-SKPl was 
transformed into yeast strain Y190, and this strain was then used in a two hybrid 
screen with a human breast cDNA library generated in A.ACTII as described by below. 

Yeast strain Y190 was deposited with the ATCC and assigned number (96400). 
Y190 was grown in YPD medium (10 g/1 yeast extract 20 g/1 peptone and 20 g/1 
dextrose) containing 10 mg/ml cycloheximide or on YPD plates (YPD medium 
containing 20 g/1 agar) containing 10 mg/ml cycloheximide. Y190 contains two 
chromosomally located reporter genes whose expression is regulated by GaI4. 

The first reporter gene is the E. coll lacZ gene which is under the control of 
the GAL1 promoter. The second reporter gene is the selectable HIS3 gene which 
encodes the enzyme imidazole glycerol phosphate (IGP) dehydrogenase. Yeast cells 
which express the HIS3 gene product can be selected by-theirability to grow in 
medium lacking histidine {He., SC-his medium). The X ACTII phage cloning vector 
was deposited with the ATCC and assigned number 87006. This a. ACTII phage 
cloning vector was deposited as a lysogen in JM107 cells which are grown in LB 
containing 50 |-ig/ml ampicillin. 

Yeast cells (strain Y190) containing specific nutritional markers were grown on 
SC medium lacking one or more amino acids. SC medium lacking a particular amino 
acid is referred to as dropout media. SC medium is made using the following 
components: 10X YNB (67 g yeast nitrogen base without amino acids in 1 liter water, 
filter-sterilized and stored in the dark). 
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Dropout mixture components: 



adenine 

aspartic acid 

leucine 

methionine 

threonine 

tyrosine 



800 mg 
4000 mg 
2400 mg 
800 mg 
8000 mg 
1200 mg 



argimne 
histidine 
lysine 

phenylalanine 

tryptophan 

uracil 



800 mg 
800 mg 
1200 mg 
2000 mg 
800 mg 
800 mg 



To make a dropout mixture, the above components are weighed out; leaving 
out the amino acids to be selected for, combined, and ground into a fine powder using 
a mortar and pestle. 

SC-Trp plates comprise per liter: 870 mg dropout mixture (minus tryptophan), 
20 g dextrose, 1 ml IN NaOH. 20 c agar, water to 900 ml. The mixture is then 
autoclaved. After autoclaving. 100 ml 10X YNB is. added just prior to pouring the 
plates. 

The bacterial strain used was E. coll strain BNN132 (ATCC 47059). These 
ceils were grown in LB (10 g/I bacto-tryptone [DIFCO], 5 g/1 bacto-yeast extract 
[DIFCO], 10 g/1 NaCL pH adjusted to 7.0 with NaOH)/ E. cpli strain BL2i(DE3) 
(Invitrogen) was grown in LB. 

As described in more detail below, the pAS2/Skpl/Y190 strain was transformed 
with 0.05 mg of plasmid library and 5 mg of carrier total yeast RNA, and 
transformants were plated on a minimal media. lacking histidine. leucine, and 
tryptophan, but containing 25 mM 3-aminotriazole. After 5 days at 30°C, plasmids 
were recovered from p-uaiactosidase positive colonies (See, Harper ef at. Cell 75:805- 



Also as described, in more_ detail below, sequencing of cDNA inserts from 
positive plasmids revealed the presence of one cDNA containing significant sequence 
identity to Cdc4 in the F-box domain of Cde4. This cDNA is referred to as "F3 
gamma." Other F-box containing cDNAs were identified by searching the EST 
(expressed sequence tag) database, with the F-box region from F3 gamma. As novel 
F-bo x containing proteins were identified these were used to further search the EST 
database, in order to identify other novel F-box proteins. For some of these, both the 



816 [1993]). 
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human and mouse homologs were identified. It is contemplated that these new F-box * 
proteins act as components of E3 complexes in mammalian cells (i.e., analogous to 
Cdc4 in budding yeast). Table 2 below lists the protein sequences identified in these 
experiments, while Table 3 provides the corresponding DNA sequences. Figure 7 
provides the alignments of these F-box 'proteins, with gaps indicated by dashes. Table 
4 provides longer (i.e., more complete) cDNA and amino acid sequences for some of 
the F-box proteins identified in the preliminary experiments. The sequences included 
in Table 4 contain at least a large portion of the open reading frames (ORFs), and 
contain potential target binding domains. Both F 1 and F2 contain leucine rich repeats 
(e.g., similar to Grrl). Thus, the present invention provides numerous sequences 
suitable for detection and identification of additional F-box proteins, as well as targets 
for intervention in the proteolysis pathways (e.g., for drugs and other compounds 
suitable for use to either enhance or reduce the efficiency and/or function of the F- 
box). 

A. Generation of. Human Breast Tissue cDNA Library in pACTII 

In order to facilitate the isolation of F-box gene sequences using the yeast two- 
hybrid system, an human breast tissue cDNA expression library was constructed in the 
X ACTII phage cloning vector. This cloning vector allows for the construction of 
cDNA libraries fused to sequences encoding the Gal4 transcriptional activation domain. 
The phage can be converted to a plasmid form (pACTII) as described below. 

An human breast tissue cDNA library was constructed using X ACTII as 
follows. Total RNA from breast tissue of an adult female obtained from reductive 
rnammoplasty was provided by Dr. Anne Bowcock (University of Texas Southwestern 
Medical Center). PolyA+ mRNA was produced using an mRNA isolation system 
(GIBCO-BRL). cDNA synthesis was accomplished using a directional cDNA 
synthesis kit from Stratagene as described by the manufacturer. 

After the synthesis of the second strand, the cDNA (in a volume of 400 was 
— spermine -preciaitate,d_by_jhe a ddition o f 22 p.1 of 100 mM spermine. The mixture was 
incubated on ice for 30 min and then pelleted by centrifugation in a microcentrifuge 
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(Eppendorf) for 15 min at 4°C. The cDNA pellet was washed three times for 30 
min/wash with 1 ml of spermine wash buffer (70% ethanol, 10 mM Mg (Ac) 2 , 0.3 M 
NaAc at pH 7) and once with 1 ml of 70% ethanol. The cDNA was then dissolved in 
50 |al of TE buffer (10 mM Tris-HCl, pH 8.0, 1 mM EDTA). 

"The ends of the cDNA were made blunt by treatment with T4 DNA polymerase 
using conditions recommended by the manufacturer (Stratagene). Following treatment 
with T4 DNA polymerase, 5 \i\ of 0.5 M EDTA was added and the mixture was 
extracted with phenol/chloroform and precipitated with ethanol. 

The- precipitated cDNA, approximately 4 jag, was resuspended in 7 jal of TE 
buffer and then ligated to 2 jag of a kinased adapter oligonucleotide in a total volume 
of 10 ul at 4°C overnight (12-18 hr). The hybridized oligonucleotide pair contained 
an EcoKL overhang. For example, the oligonucleotide CGCGCG hybridized with 
AATTCGCGCG (SEQ ID NO:59) will create a suitable EcoRl linker. 

Following the ligation reaction, 170 ul of TE buffer, 20 of 1 M KC1 and 10 
of 100 mM spermine were added. The mixture was incubated on ice for 30 min 
and precipitated and washed as described above. The adapted cDNA was resuspended 
in 20 \i\ of TE buffer and digested with Xhol prior to electrophoresis on a 1% LMA 
gel. cDNA having a length of 600 bp or longer was excised from the gel and purified 
using standard techniques prior to ligation into X ACTII arms. 

cDNA (0.1 j-ig) was ligated with 2 jug of a. ACTII plasmid DNA prepared as 
follows. One hundred micograms of X ACTII plasmid DNA was digested with Xhol 
and EcoRl; the digestion products were then precipitated with ethanol. briefly dried 
and then resuspended in J 90 ul of TE buffer. Ten microliters of 10 mM spermine 
were added to the side of the tube and the contents were mixed by rapid inversion of 
the tube. An immediate and obvious precipitate formed and was pelleted by 
centrifugation for 2 sec in a tabletop microcentrifuge. The pellet was then washed 
with spermine wash buffer followed by a wash with 70% ethanol as described above. 
The washed pellet was resuspended in 100 jj.1 of TE buffer. This preparation of X 
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ACTII plasmid DNA was then used for ligation into the cDNA containing adapters - 
(prepared as described above). 

The ligation of the adapted cDNA and digested X ACTII plasmid DNA was 
performed in a volume of 4 jal at 4°C overnight. The ligation mixture was packaged 
using one Gigapack Gold packaging extract (Stratagene) according to the 
manufacturer's instructions. Approximately 1 x 10 s total recombinants were obtained. 
The phage library was amplified on the LE392 strain of E. coli (Stratagene). 

Automatic subcloning conversion of the cDNA library in X ACTII into 
plasmid (pACTII) was accomplished by the incubation of 10 9 phage particles with 2 
ml of a fresh overnight culture of E. coli strain BNN132 (ATCC 47059) in 10 mM 
MgCl 2 for 30 min at 30°C without shaking. Two milliliters of LB (10g/l bacto- 
tryptone, 5g/l bacto-yeast extract, 10 g/1 NaCL pH adjusted to 7.0 with. NaOH) was 
then added and the cells were incubated with shaking for 1 hr at 30°C. The cells were 
then plated on ten 150 mm LB plates (15 g/1 bacto-agar in LB) containing 50 \±%lm\ 
ampicillin and incubated overnight at 37°C. ... 

Ampicillin-resistant cells were collected by scraping the plates; the cells were 
then added to 3 liters of terrific broth (12 g/1 bacto-tryptone, 24 g/1 bacto yeast extract 
and 100 ml/1 of a solution comprising 0.17 M KH 2 P0 4 , 0.72 M K 2 HP0 4 ) containing 50 
jj.g/ml ampicillin. The culture was grown to stationary phase and plasmid DNA was 
isolated using CsCl density gradients by standard methods (J. Sambrook et aL, supra, 
at pp. 1.33-1.48) 

B. Isolation of F-Box Sequences 

In this portion of the Example, an improved version of the yeast two-hybrid 
system was employed to identify proteins that bind to Skpl. The two-hybrid system 
employs genetic selection to allow the isolation of interacting proteins. The use of 
genetic, selection for the detection of interacting proteins allows much larger cDNA 
libraries to be screened for associating clones than could be accomplished using other 
techniques (i.e., screening expression libraries, such as kgtll, with labelled proteins). 
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The improved two-hybrid system employs the yeast strain Y190 as the recipient" 
cell line. The yeast strain Y190 contains two chromosomaliy located reporter genes 
whose expression is regulated by Gal4. The first reporter gene is the E. coli lacL 
gene, which is under the control of the GAL1 promoter. The second reporter gene is 
the selectable HIS3 gene. The two-hybrid system is improved by the use of an 
additional assay to eliminate false positives. "False positives" are defined as library 
clones that activate transcription in cells expressing fusions unrelated to the target 
protein {i.e., Skpl). To isolate interacting proteins, Y190 cells are first transformed 
with a first expression plasmid which encodes a fusion protein comprising a hybrid 
between the DNA-binding domain of the yeast transcription factor Gal4 (amino acids 
1-147) and a target protein {i.e., a protein which is used to identify proteins capable of 
interacting with this target protein). The transformed Y190 cells are next analyzed to 
determine the effect of the introduction of the first expression plasmid. If the 
transformation of Y 190 cells with the expression plasmid which expresses the target 
protein does not activate either the HIS3 or lacZ reporter genes, this transformed strain 
can now be used for screening an activation domain cDNA library. 

The activation domain library comprises plasmids capable of expressing the 
second hybrid molecules of the two-hybrid system. The second hybrids comprise 
fusion proteins containing the sequences encoding the Gal4 activation domain II 
(amino acids 768-881) fused to a cDNA library generated from human breast tissue 
(described above). When the Y190 cells transformed with the first expression plasmid 
are transformed with a second expression vector (from the expression library) capable 
of expressing a protein or portion of a protein which can bind to the Skpl hybrid, 
transcription of the His3 and lacZ genes is activated as the binding of the second 
hybrid brings the Gal4 activation domain II in close proximity to the DNA binding 
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domain of the Gal4 protein which is bound to the UAS G upstream of the His3 and 
lacZ genes on the chromosome. 

In this two-hybrid system, Y190 cells were transformed with the expression 
plasmid pAS2-Skpl using lithium acetate according to standard techniques (F.M. 
Ausubel, et aL, Short Protocols in Molecular Biology, John Wiley & Sons, New York 
[1992], pp. 13-29-13,30). The expression plasmid, pAS2-Skpl encodes a fusion 
protein comprising a hybrid between the DNA-binding domain of the yeast 
transcription factor Gal4 (amino acids 1-147) and the Skpl molecule. This first hybrid 
acts as "bait" for the second hybrid molecule; the Gal4/Skpl hybrid binds to the 
upstream activating sequence from GAL1 (UAS G ) sequences located upstream of the 
His3 and lacZ genes in the host cell chromosome. Because the GAL4-Skpl hybrid 
lacks trans-activating sequences. Y190 cells transformed with pASl-Skp 1 were His" 
and white. 

Y190 cells were transformed with the pAS2-Skpl plasmid as follows. Y190 
cells were grown in 5 ml of YPD medium (10 g/1 yeast extract, 20 g/1 peptone and 20 
g/1 dextrose) overnight to saturation at 30°C. The next day, a liter sterile flask 
containing 300 ml of YPAD medium (YPD containing 30 mg/1 adenine hemisulfate) 
was inoculated with the overnight culture and grown overnight at 30°C to a density of 
1 x 10 7 cells/ml. The cells were then collected by centrifugation at 4000 x g for 5 min 
at room temperature. The cell pellet was then washed by resuspending the cells in 10 
ml sterile H 2 0 followed by centrifugation at 5000 x g for 5 min at room temperature. 
The washed cells were resuspended in 1.5 ml of LiAcTE (1 vol 10X TE buffer [100 
mM Tris-HCI, 10 miM ED.TA], pH 7.5), plus 1 vol of 10X Li Ac stock solution (1M 
lithium acetate, pH 7.5) plus 8 vol sterile H 2 0. Five micrograms of pAS2-Skpl DNA 
and 200 \ig carrier DNA (single-stranded, high molecular weight carrier DNA was 
prepared from salmon sperm DNA usihg^standard protocols; yeast total RNA may also 
be used as a carrier) were placed in a sterile 1.5. ml microcentrifuge tube in a total 
volume of 20 Two hundred microliters of the yeast suspension was added to the 
Jilbei^lowed^y^the^addition of 1.2 ml of a LiAcPEG solution (8 vol of 50% (w/v) 
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polyethylene glycol, MW 3350 plus 1 vol of I OX TE buffer, pH 7.5 plus I vol 10X , 
LiAc stock solution). The cells were then shaken for 30 min at 30°C, followed by a 
heat shock (15 min at 42 C C). Following the heat shock, the cells were collected by 
centrifugation for 5 sec at room temperature in a tabletop microcentrifuge. The cell 
5 pellet was the resuspended in 1 ml of TE buffer and 200 p.1 of the suspension were 

spread onto SC-Trp medium. 

The transformed Y190 cells (Y190/pAS2-Skpl) were then transformed with a 
pACTII-human breast tissue cDNA library as described below. The plasmids 
contained within this library encode the second hybrids of the two-hybrid system. The 

10 second hybrids comprised fusion proteins containing the sequences encoding the Gal4 

activation domain II (amino acids 768-881) fused to a cDNA library, generated from 
human breast tissue. When a Y190/pAS2-Skpl cell is transformed with a pACTII 
expression vector capable of expressing a protein or portion of a protein which can 
bind to the Gal4-Skpl hybrid, transcription of the- His3 and LacZ genes is activated as 

15 the binding of the second hybrid brings the Gal4 activation domain II in close 

proximity to the DNA binding domain .of the GaI4 protein which is bound to the 
UAS G upstream of the His3 and lacZ genes on the chromosome. 

Y190/pAS2-Skpl cells were transformed with the pACTII-human breast cDNA 
library as follows. Briefly, the recipient strain (Y190/pAS2-Skpl cells) were grown to 

20 mid-log phase (1 x 10 7 cells/ml) in SC-Trp medium (SC medium lacking tryptophane). 

The OD 600 of this culture was determined and 1 liter of YPD medium was inoculated 
with enough of the culture such that in 2 generations the cell density became. 1 x 10 7 
cells/ml. The cells where pelleted by centrifugation and the pellet was resuspended in 
LiAcTE (the volume is not critical as this is a wash step). The cells were pelleted by 

25 centrifugation and the cells were resuspended in 25 ml of LiSORB (100 mM LiAc, 10 

mM Tris-HCl (pH 8.0), 1 mM EDTA, 1 M sorbitol). The cells were then incubated 
for 30 min at 30°C with shaking. The cells were then pelleted by centrifugation as 
described above and resuspended in 2.5 ml of LiSORB.' After removing 100 |il of 
cells for a negative control, 50 jag of pACTII library DNA and 5 mg of yeast total 

30 RNA carrier, was added. The mixture was mixed well and then incubated for .10 min~ 
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at 30°C without shaking. The cells were then transferred to a 250 ml flask and 22.5 * 
ml of LiAcPEG (LiAcTE containing 40% polyethylene glycol, MW 3350) was added 
and the suspension was well mixed. The flask was then placed in a 42°C water bath 
for 12 min to heat shock the cells. Following the heat shock, the transformation 
mixture was added to 500 ml of SC-Trp, -Leu, -His medium and the culture was 
allowed to recover at 30°C for 4 hours; at this point the cells are established as 
transformants. Next, 4 x 10 5 transformants were obtained by transformation of 1 x 
10 10 Y190/pAS2-Skpl cells with 50 ng of pACTII library DNA. 

Transformants were subjected to selection for histidine prototrophy by plating 
300 \il of the culture on 15 cm petri dishes containing SC-Trp, -Leu, -His medium 
containing 50 mM 3 amino-triazole (Sigma), and incubated for 30°C for 3-5 days. 

The rare surviving colonies were screened for their ability to produce (3- 
galactosidase using a filter lift assay (L. Breeden and K. Nasmyth, Cold Spring Harbor 
Symp. Quant. Biol., 50:643-650 [1985]). Briefly, colonies were transferred onto 
nitrocellulose filters (Scheicher and Schuell, BA85 45 circular filters) by laying the 
filters onto plates containing the yeast colonies and allowing the filter to wet 
completely. The filters were lifted off the plates and then submerged in liquid 
nitrogen for 5-10 sec. The filters were then placed cell side up into a petri dish 
containing 3 MM chromatography paper (Whatman) saturated 0.3 ml/square inch with a 
solution comprising 100 mM sodium phosphate. pH 7.0, 10 mM KG. 0.7 mM 
magnesium sulfate, 10 mM 2-mercaptoethanol and 1 mg/ml X-gal (5-bromo-4-chloro- 
3-indolyl-p-galactoside). The filters were incubated at 30°C until blue color developed 
(30 min to overnight). . Positive (/.<?., blue) colonies were then patched onto a master 
plate for further analysis. 

Plasmids recovered from 20 His+ blue (i.e., lacZ expressing) colonies were 
sequenced from their 5' ends using the chain termination method in conjunction with 
the Sequenase® enzyme (U.S. Biochemicals); sequencing was performed according the 
manufacturer's instructions. The amino acid sequence was deduced from the 
nucleotide sequences located within the 20 inserts and were compared to„sequences 
listed in the GenBank. One of these 20 clones were found to be related to the F-box 
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protein met30 from S. cerevisiae in the F-box domain. The F-box region of Fgamm^ 
was used to search the EST database of Genbank, and identified two novel mammalian 
F-box-containing cDNAs (F13 Omicron and F14 Pi), in addition to two F-box proteins 
from C elegans (C02FS7 and YK18A11). The Genbank accession numbers for these 
four cDNAs are AA422959, AA462249, R12719, and D35163). 

These novel F-box sequences were used to search the EST sequences listed in 
Genbank. This search yielded F7 Theta (AA008567), Fl alpha (F12916), F4 Delta 
(AA167804), TRCP (AA478504), F6 Eta (AA027176), F15 Rho (AA538102), FS Iota 
(AA295683), MD6 (AA145853), and Skp2 homologs (U33761). These F-box protein 
sequences were then used in additional EST database searches and yielded sequences 
for F2 Beta (H58S48), F5 Zeta (R17328), F9 Kappa (AA459120), F10 Lambda 
(AA501293), Fll Mu (AA069757), F12 Nu (AA000239), F16 Sigma (H49462), F17 
Tau (AA381895), and F18 Phi (AA309734). cDNA clones in bacterial plasmids 
including pBluescript (Stratagene) were retrieved from the EST cDNA Image 
Consortium and subjected to sequence analysis using dideoxy DNA sequencing (See 
e.g., Sambrook et ai, supra) to verify the F-box sequences. For three cDNAs (Fl 
alpha, F2 beta, and F4 delta, complete sequences of the available cDNA clones were 
obtained by standard primer walking (See e.g., Sambrook ei al, supra). 

This approach finds use in identification of other novel F-bpx containing 
proteins, either using cDNA libraries from other tissues or by performing additional 
database searches. The use of cDNA libraries from various tissues allows the 
identification of F-box proteins that are cell-type specific. Additional alternative 
approaches, such as expression screening of Xgtl I -based plaque libraries with Skpl 
protein are also contemplated as methods for yielding novel F-box proteins. 
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Table 2. F-Box Sequences Identified 



"Mamp Source 

&. Genbank # 


Amino Acid Sequence 


SEQ ID 
NO: 


TRCP 

f Vi i imfln^ 
li lull lull J 

AA478504 


LPARGLDHIAENILSYLDAKSLCAAELVCKEWYRVTS 
DGMLW 


SEQ ID 
NO:l 


(human) 
F12916 


LPKELLLRIFSFLDIVTLCRCAOISKAWNILALDGSNW 


SEQ ID 
NO:3 


(human) 
H58848 


T PYFT IOI II NHLTLPDLCRLAOTCKLLSOHCCDPLOY 


SEQ ID 
NO:5 


F2 (Beta) 
(mouse) 


LPYELIQLILNHLSLPDLCRLAQTCRLLHQHCCDPLQY 


SEQ ID 
NO: 7 


r J ^vjrduiniay 

(human) 
AA 10 1399 


T PTDPI-I f II SFI DYRDL INCCYVSRRLSOLSSHDPLW 


SEQ ID 
NO:9 


F3 (Gamma) 
(mouse) 


LPTDPLLLIVSFVDYRDLINCCYVSRSVSQLSTHDPLW 


SEQ ID 
NO: 1 1 


(human) 
AA167804 


t PPFVMT SIFSYT NPOFT CRCSOVSMKWSOLTKTGSL 
W 


SEQ ID 
NO: 13 


(mouse) 


r PPFVMT SIFSYT NPOFT rRPSOVSTKWSOT AKTGSL 

w 


SEQ ID 
NO:15 


F5 (Zeta) 

(human) 

R17328 


LPLEMLTYILSFLPLSDQKEASLVSWAWYRAAQNALR 
ERLW 


SEQ ID 
NO: 17 


F6 (Eta) - - _ 

(human) 

AA027176 


LPPELSFTILSYLNATpLCLASCVWQDLANDELLW 


SEQ ID 
NO: 19 


F6 (Eta) 
(mouse) 
AA2 13046 


LPPELSFTILSYLNAIDLCLASCVWQDLANDELLW 


SEQ ID 
NO:21 
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Table 2. F-Box Sequences Identified 





Name, Source 
& Genbank # 


Amino Acid Sequence 


SEQ ID' 
NO: 




F7 (Theta) 

(mouse) 

AA008567 


LPRVLSVYIFSFLDPRSLCRCAQVSWYWKSLAELDQL 
W 


SEQ ID 
NO:23 


5 


FS (Iota) 
(human) 
AA295683 


LPIDVQLYILSFLSPHDLCQLGSTNHYWNETVRHPILW 


SEQ ID 
NO:25 




F9 (Kappa) 

(human) 

AA459120 


LPLELWRMILAYLHLPDLGRCSLVCRAWYELILSLDST 
RW 


SEQ ID 
NO:27 


10 


F10 (Lambda) 

(mouse) 

AA50I293 


LPAEITFKIFSQLDIRSLCRASLTCRSWNDFKS 


SEQ ID 
NO:29 


15 


Fll (Mu) 

(mouse) 

AA069757 


LPLLQQPLLCSVAHPIASFTMLSYLTGKEAAHLSVELW 


SEQ ID 
NO:31 




F12 (Nu) 

(mouse) 

AA000239 


LPDSLVYQIFLSLGPADVLAAGLVCRQWQAVSRJDEFL 
W 


SEQ ID 
NO:33 


20 


F13 

(Omicron) 

(mouse) 

AA422959 


LPEEVLALIFRDLPLRDLAVATRVCRAWAAA 


SEQ ID 
NO:35 


25 


F14 (Pi) 
(mouse) 
AA462249 


LPSYPMMEILSYLDAYSLLQAAQVNKNWNELASSDV 
LW 


SEQ ID 
NO:37 




F15 (Rho) 

(mouse) 

AA538102 


MPSEILVKILSYLDAVTLVCIGCVSRRFYHLADDNLIW 


SEQ ID 
NO:39 


30 


F16 (Sigma) 

(human) 

AA49462 


LPMEVLMYIFRWVVSSDLDLRSLEQLSLVCRGFYICA 
RDPEIW 


SEQ ID 
NO:41 
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Table 2. F-Box Sequences Identified 



Name, Source 
& Genbank # 


Amino Acid' Sequence 


SEQ ID 
NO: 


F16 (Sigma) 
(mouse) 
AA4 10485 


LSLVCRGFYICARDPEIW 


SEQ ID 
NO:43 


F17 (Tau) 

(human) 

AA381895 


LPYELAINIFXYLDRKELGRCAQVSKTWEGD 


SEQ ID 
NO:45 


F18 (Phi) 

(human) 

AA309734 


LPLELKLPJFRLLDVRSVLSLSAVCRDLFTASNDPLLW 


SEQ ID 
NO:47 


F.18 (Phi) 

(mouse) 

W20645 


LPLELKLRJFRLLDVHSVLALSAVCHDLLIASNDPLLW . 


SEQ ID 
NO:49 


MD6 
(human) 


LPLELSFYLLKWLDPQTLLTCCLVSKQWNKVISACTE 
VW 


. SEQ ID 
NO:57 
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Table 3. F-Box Sequences Identified 



Name and 
Source 


DNA Sequence 


SEQ ID 
NO: • 


TRCP 
(human) 


CTGCCAGCTCGGGGATTGGATCATATTGCTGAGAAC 
ATTCTGTCATACCTGGATGCCAAATCACTATGTGCT 
GCTGAACTTGTGTGCAAGGAATGGTACCGAGTGACC 
TCTGATGGCATGCTGTGG 


SEQ ID 
NO:2 


Fl (Alpha) 
(human) 


TTACCCAAAGAACTTCTGTTAAGAATATTTTCCTTCT 
TGGATATAGTAACTTTGTGCCGATGTGCACAGATTT 
CCAAGGCTTGGAACATCTTAGCCCTGGATGGAAGCA 
ACTGG 


SEQ ID 
NO:4 


F2 (Beta) 
(human) 


CTACCTTATGAGCTTATTCAGCTGATTCTGAATCATC 
TTACACTACCAGACCTGTGTAGATTAGCACAGAC 


SEQ ID 
NO:6 


F2 (Beta) 
(mouse) 


CTACCATATGAGCTCATTCAACTGATTCTGAATCAT 
CTTTCACTACCAGACCTGTGTAGATTAGCCCAGACT 
TGCAGGCTTCTCCACCAGCATTGCTGTGATCCTCTG 
CAATAT 


SEQ ID 
NO:S 


F3 (Gamma) 
(human) 


CTGCCCACCGATCCCCTGCTCCTCATCTTATCCTTTT 
TGGACTATCGGGATCTAATCAACTGTTGTTATGTCA 
GTCGAAGAGTTAGCCAGCTATCAAGTCATGATCCGC 
TGTGG 


SEQ ID 
NO: 10 


F3 (Gamma) 
(mouse) 


CTACCCACCGACCCTCTGCTCCTCATAGTATCCTTCG 
TGGACTACAGGGACCTAATCAATTGTTGCTATGTTA 
GTCGAAGCGTTAGCCAGCTATCAACTCATGATCCAC 
TGTGG 


SEQ ID 
NO: 12 


F4 (Delta) 
(human) 


CTTCCTCCTGAGGTAATGCTGTCAATTTTCAGCTATC 
TTAATCCTCAAGAGTTATTCGATGCAGTCAAGTAAG 
CATGAAATGGTCTCAGCTGACAAAAACGGGATCGCT 
TTGG 


SEQ ID 
NO:14 


■ -- 

F4 (Delta) 
(mouse) 


CTTCCTCCTGAGGTAATGCTGTCCATTTTCAGTTACC 
TTAATCCTCAAGAATTGTGTCGGTGTAGTCAAGTCA 
GTACTAAGTGGTCTCAGCTGGCAAAAACAGGATCTT 
TGTGG 


SEQ ID 
NO:l6 


F5 (Zeta) 
(human) 


CTGCCCCTGGAGATGCTCACATATATTCTGAGCTTC 
CTGCCTCTGTCAGATCAGAAAGAGGCCTCCCTCGTG 
AGTTGGGCTTGGTACCGTGCTGCCCAGAATGCCCTT 


SEQ ID 
NO: 1 8 




CGGGAGAGGCTGTGG 
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Table 3. F-Box Sequences Identified 



Name and 
Source 


DNA Sequence 


SEQ id ; 

' NO;.: 


F6 (Eta) 
(human) 


TTGCCTCCTGAGCTAAGCTTTACCATCTTGTCCTACC 
TGAATGCAACTGACCTTTGCTTGGCTTCATGTGTTTG 
GC AG G ACCTTGCG AATG ATG A ACTTCTCTGG 


SEQ ID 
NO:20 


F6 (Eta) 
(mouse) 


CTGCCTCCTGAGCTGAGCCTCACCATCCTATCCCAC 

CTGGATGCAACTGACCTTTGCCTAGCTTCCTGTGGTT 

GGCAAGAACTCGCTAATGATGAACTTCTCTGG 


SEQ ID 
NO:22 


F7 (Theta) 
(mouse) 


CTTCCAAGGGTGTTATCTGTCTACATCTTTTCCTTCC 
TGGATCCCCGGAGTCTTTGCCGTTGTGCACAGGTGA 
GCTGGTACTGGAAGAGCTTGGCTGAGTTGGACCAGC 
TCTGG 


SEQ ID 
NO:24 


F8 (Iota) 
(human) 


CTGCCGATTGATGTACAGCTATATATTTTGTCCTTTC 
TTTCACCTCATGATCTGTGTCAGTTGGGAAGTACAA 
ATCATTATTGGAATGAAACTGTAAGACATCCAATTC 
TTTGG 


SEQ ID 
NO:26 


F9 (Kappa) 
(human) 


CTCCCCTTGGAGCTGTGGCGCATGATCTTAGCCTAC 
TTGCACCTTGCCGACCTGGGCCGCTGCAGCCTGGTA 
TGCAGGGCGTGGTATGAACTGATCCTCAGTCTCGAC 
AGCACCCGCTGG 


SEQ ID 
NO:28 


F.10 (Lambda) 
(mouse) 


CTGCCTGCAGAAATCACTTTTAAAATTTTCAGTCAG 
CTGGACATTCGGAGTCTGTGCAGGGCTTCATTGACA 
TGCAGGAGCTGGAATGAC 


SEQ ID 
NO:30. 


Fll (Mu) 
(mouse) 


CTGCCATTACTGCAGCAGCCACTTCTGTGTTCTGTG 
GCTCATCCCATCGCCAGCTTCACCATGCTGTCATAC 
CTCACGGGAAAGGAGGCCGCTCATCTGTCAGTGGA 
GTTGTGG 


SEQ ID 
NO:32 


F12 (Nu) 
(mouse) 


CTCCCCGACAGCCTTGTCTACCAGATCTTCCTGAGTT 
TGGGCCCTGCAGATGTGCTGGCTGCTGGGCTGGTAT 
GCCGCCAATGGCAGGCTGTGTCCCGGGATGAGTTCT 
TATGG 


SEQ ID 
NO. 34 


F13 

(Omicron) 
(mouse) 


CTGCCAGAGGAAGTGTTGGCGCTCATCTTCCGTGAC 
CTGCCTCTCAGGGACCTTGCTGTAGCCACCAGAGTC 
TGCAGGGCCTGGGCGGCGGCT 


SEQ ID 
NO:36 
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Tabic 3. F-Box Sequences Identified 



Name, and 
Source 


DNA Sequence 


SEQ ID 
NO: 


F14 (Pi) 
(mouse) 


TTACCTAGTGTGCCGATGATGGAAATCCTCTCCTAT 
CTGGATGCCTACAGTTTGCTACAGGCTGCCCAAGTG 
AACAAGAACTGGAATGAAGTTGCAAGCAGTGATGT 
CCTGTGG 


SEQ ID 
NO:38 


F15(Rho) 
(mouse) 


ATGCCATCGGAAATCTTGGTGAAGATACTTTCTTAC 
TTGGATGCGGTGACCTTGGTGTGCATTGGATGTGTG 
AGCAGACGCTTTTATCATTTGGCTGATGACAATCTT 
ATTTGG 


SEQ ID 
NO:40 


F16 (Sigma) 
(human) 


CTGCCAATGGAGGTCCTGATGTACATCTTCCGATGG 
GTGGTGTCTAGTGACTTGGACCTCAGATCATTGGAG 
CAGTTGTCGCTGGTGTGCAGAGGGTTCTACATCTGT 
GCCAGAGACCCTGAAATATGG 


SEQ ID 
NO:42 


F16 (Sigma) 
(mouse) 


GACTTGGACCTCAGATCGTTAGAGCAGTTGTCACTG 
GTGTGCAGAGGATTCTATATCTGTGCCAGAGACCCT 

GAAATCTGG 


SEQ ID 
NO:44 


F17 (Tau) 
(human) 


CTGCCTTACGAATTGGCAATCAATATATTTNAGTAT 
CTGGACAGGAAAGAACTAGGAAGATGTGCACAGGT 
GAGCAAGACGTGGGAAGGTGATT 


SEQ ID 
NO:46 


F18 (Phi) 
(human) . 


CTCCCATTGGAACTGAAACTACGGATCTTCCGACTT 
CTGGATGTTCGTTCCGTCTTGTCTTTGTCTGCGGTTT 
GTCGTGACCTCTTTACTGCTTCAAATGACCCACTCCT 

GTGG 


SEQ ID 
NO:48 


F18 (Phi) 
(mouse) 


CTTCCAC 1 CiuAuL 1 uAAAL 1 ALULA I L. l H-^Urt^ i l 
TTGGATGTTCATTCTGTCCTGGCCCTGTCTGCAGTCT 
GTCATGACCTCCTCATTGCGTCAAATGACCCACTGC 
TGTGG - 


SFO ID 

Oby IIS 

NO:50 


MD6 
(human) 


CTTCCCCTGGAGCTCAGTTTTTATTTGTTAAAATGGC 
TCGATCCTCAGACTTTACTCACATGCTGCCTCGTCTC 
TAAACAGTGGAATAAGGTGATAAGTGCCTGTACAG 
AGGTGTGG 


SEQ ID 
NO:58 
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Tabic 4. Sequences of Some F-Box Proteins 



Name & 
Source 


Sequence 


SEQ ID 
NO: 


Fl Alpha 
(human) 


SAMVFSNNDEGLINKKLPKELLLRIFSFLDIVTLCRCA 

QISKAWNILALDGSNWQRIDLFNFQIDVEGRVVENISK 

RCGGFLRKLSLRGCIGVGDSSLKTFAQNCRNIEHLNLN 

GCTKITDSTCYSLSRPCSKLKHLDLTSCVSITNSSLKGI 

SEGCRNLEYLNLSWCDQITKDGIEALVRGCRGLKALL 

LRGCTQLEDEALKHIQNYCHELVSLNLQSCSRITDEGV 

VQICRGCHRLQALCLSGCSNLTDASLTALGLNCPRLQI 

LEAARCSHLTDAGFTLLARNCHELEKMDLEECILITDS 

TLIQLSIHCPKLQALSLSHCELITDDGILHLSNSTCGHE 

RLRVLELDNCLLITDVALEHLETAEAWSASSCTTASRL 

PVQASSGCGLSSLMSKSTPTLLPSPHRQQWQEVDSDC 

AGAVSFSDSSCLGPRGDEASFPLEDLSLPDRLHHHPIC 


SEQ ID 
NO:5i 
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Table 4. Sequences of Some F-Box Proteins 



Name & 
Source 


Sequence 


SEQ ID 
NO: 


Fl Alpha 
(human) 


TTCGGCCATGGTTTTCTCAAACAATGATGAAGGCCT 

TATTAACAAAAAGTTACCCAAAGAACTTCTGTTAAG 

AATATTTTCCTTCTTGGATATAGTAACTTTGTGCCGA 

TGTGCACAGATTTCCAAGGCTTGGAACATCTTAGCC 

CTGGATGGAAGCAACTGGCAAAGAATAGATCTTTTT 

AACTTTCAAATAGATGTAGAGGGTCGAGTGGTGGA 

AAATATCTCGAAGCGATGCGGTGGATTCCTGAGGA 

AGCTCAGCTTGCGAGGCTGCATTGGTGTTGGGGAJT 

CCTCCTTGAAGACCTTTGCACAGAACTGCCGAAACA 

TTGAACATTTGAACCTCAATGGATGCACAAAAATCA 

CTGACAGCACGTGTTATAGCCTTAGCAGATTCTGTT 

CCAAGCTGAAACATCTGGATCTGACCTCCTGTGTGT 

CTATTACAAACAGCTCCTTGAAGGGGATCAGTGAGG 

GCTGCCGAAACCTGGAGTACCTGAACCTCTCTTGGT 

GTGATCAGATCACGAAGGATGGCATCGAGGCACTG 

GTGCGAGGTTGTCGAGGCCTGAAAGCCCTGCTCCTG 

AGGGGCTGCAGACAGTTAGAAGAT-GAAGCTCTGAA 

ACACATTCAGAATTACTGCCATGAGCTTGTGAGCCT 

CAACTTGCAGTCCTGCTCACGTATCACGGATGAAGG 

TGTGGTGCAGATATGCAGGGGCTGTCACCGGCTACA 

GGCTCTCTGCCTTTCGGGTTGCAGCAACCTCACAGA 

TGCCTCTCTTACAGCCCTGGGTTTGAACTGTCCGCG 

ACTGCAAATTTTGGAGGCTGCCCGATGCTCCCATTT 

GACTGACGCAGGTTTTACACTTTTAGCTCGGAATTG 

CCACGAATTGGAGAAGATGGATCTTGAAGAATGCA 

TCCTGATAACCGACAGCACACTCATCCAGCTCTCCA 

TTrArTGTCCTAAACTGCAAGCCCTGAGCCTGTCCC 

ACTGTGAACTCATCACAGATGATGGGATCCTGCACC 

TGAGCAACAGTACCTGTGGCCATGAGAGGCTGCGG 

GTACTGGAGTTGGACAACTGCCTCCTCATCACTGAT 

GTGGCCCTGGAACACCTAGAAACTGCCGAGGCCTG 

GAGCGCCTCGAGCTGTACGACTGCCAGCAGGTTACC 

CGTGCAGGCATCAAGCGGATGCGGGCTCAGCTCCCT 

CATGTCAAAGTCCACGCCTACTTTGCTCCCGTCACC 

CCACCGACAGCA 


SEQ ID 
NO:52 
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Table 4. Sequences of Some F-Box Proteins 



Name & 
Source 


Sequence 


SEQ ID 
. "NO: 


F2 Beta 
(human) 


RPRFGTSDIEDDAYAEKDGCGMDSLNKKFSSAVLGEG 

PNNGYFDKLPYELIQLILNHLTLPDLCRLAQTCKLLSQ 

HCCDPLQYIHLNLQPYWAKLDDTSLEFLQSRCTLVQW 

LNLSWTGNRGFISVAGFSRFLKVCGSELVRLELSCSHF 

LNETCLEVISEMCPNLQALNLSSCDKLPPQAFNHIAKL 

CSLKRLVLYRTKVEQTALLSILNFCSELQHLSLGSCVM 

IEDYDVIASMIGAKCKKLRTLDLWRCKNITENGIAELA 

SGCPLLEELDLGWCPTLQSSTGCFTRLAHQLPNLQKLF 

LTANRSVCDTDIDELACNCTRLQQLDILGKVTIYKFVL 

NVCFLDRKANLRLFVRKKKIFGYNKNFILIRWLGLIGN 

AR 


SEQ ID 
NO:53 
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Table 4. Sequences of Some F-Box Proteins 



Name & 
Source 


Sequence 


SEQ ID 
NO: 


F2 Beta 
(human) 


AGGCCAAGATTCGGCACGAGTGATATAGAAGATGATGCCTATG 

CAGAAAAGGATGGTTGTGGAATGGACAGTCTTAACAAAAAGT 

TTAGCAGTGCTGTCCTCGGGGAAGGGCCAAATAATGGGTATTT 

TGATAAACTACCTTATGAGCTTATTCAGCTGATTCTGAATCATC 

TTACACTACCAGACCTGTGTAGATTAGCACAGACTTGCAAACT 

ACTGAGCCAGCATTGCTGTGATCCTCTGCAATACATCCACCTCA 

ATCTGCAACCATACTGGGCAAAACTAGATGACACTTCTCTGGA 

ATTTCTACAGTCTCGCTGCACTCTTGTCCAGTGGCTTAATTTAT 

CTTGGACTGGCAATAGAGGCTTCATCTCTGTTGCAGGATTTAG 

CAGGTTTCTGAAGGTTTGTGGATCCGAATTAGTACGCCTTGAA 

TTGTCTTGCAGCCACTTTCTTAATGAAACTTGCTTAGAAGTTAT 

TTCTGAGATGTGTCCAAATCTACAGGCCTTAAATCTCTCCTCCT 

GTGATAAGCTACCACCTCAAGCTTTCAACCACATTGCCAAGTT 

ATGCAGCCTTAAACGACTTGTTCTCTATCGAACAAAAGTAGAG 

CAAACAGCACTGCTCAGCA1 i 1 1 G AACTTCTGTTC AG AGCTTC A 

GCACCTCAGTTTAGGCAGTTGTGTCATGATTGAAGACTATGAT 

GTGATAGCTAGCATGATAGGAGCCAAGTGTAAAAAACTCCGG 

ACCCTGGATCTGTGGAGATGTAAGAATATTACTGAGAATGGAA 

TAGCAGAACTGGCTTCTGGGTGTCCACTACTGGAGGAGCTTGA 

CCTTGGCTGGTGCCCAACTCTGCAGAGCAGCACCGGGTGCTTC 

ACCAG'ACTGGCACACCAGCTCCCAAACTTGCAAAAACTCTTTC 

TTACAGCTAATAGATCTGTGTGTGACACAGAC'ATTGATGAATT 

GGCATGTAA T I U f ALLAuu 1 I ALAULAOL l uoaca i t\ i i mow i 

AAGGTTACAATATATAAATTTG 1 1 i 1 AAATGTCTGTTTCCTTGA 

CAGAAAAGCCAATCTCAGACTTTTTGTTAGGAAAAAGAAAATT 

TTTGGATACAATAAAAATTTTATCCTGATAAGATGGCTTGGTTT 

GATAGGAAATGCCAGATAGATCAGTTAATATAGGGAATAATTA 

TATATGTACTTTAATAAAATAGTGAGGACAATAACAATTTTAT 

AGTTGAACTGTAAAAAACTATAACCATTAATTCTTGGTCTACTT 

GTAAGAGTGAGAATTTACATGAGCTGCGCTCTCTATTTTTATTA 

AGGAGAGAAGAAATTAATTCAtTTG.TATAATGAATTCAAGCTA 

GTTTTTTTTAAGTTTCTTAATTAAGCGGCCGCAAGCTTA 


SEQ ID 
NO:54 



- 117 - 



WO 99/18989 



PCTAJS98/21763 



Table 4. Sequences of Some F-Box Proteins 



Name & 
Source 


Sequence 


SEQ ID 
NO: 


F4 Delta 
(human) 


MVIML2ERQKFFKYSVDEKSDKEAEVSEHSTGITHLPPEVMLSIFS 

YLNPQELCRCSQVSMKWSQLTKTGSLWKHLYPVHWARGDWYS 

GPATELDTEPDDEWVKNRKDESRAFHEWDEDADIDESEESAEESI 

AISIAQMEKRLLHGLIHNVLPYVGTSVKTLVLAYSSAVSSKMVRQ 

ILELCPNLEHLDLTQTDISDSAFDSWSWLGCCQSLRHLDLSGCEKI 

TDVALEK3SRALGILTSHQSGFLKTSTSKITSTAWKNKDITMQSTK 

QYACLHDLTNKGIGEEIDNEHPWTKPVSSENFTSPYVWMLDAED 

LAD1EDTVEWRHRNVESLCVMETASNFSCSTSGCFSKDIVGLRTS 

VCWQQHCASPAFAYCGHSFCCTGTALRTMSSLPESSAMCRKAAR 

TRLPRGKDLIYFGSEKSDQETGRVLLFLSLSGCYQITDHGLRVLTL 

GGGLPYLEHLNLSGCLTITGAGLQDLVSACPSLNDEYFYYCDNIN 

GPHADTASGCQNLQCGFRACCRSGE 


SEQ ID 

NO:55 
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Table 4. Sequences of Some F-Box Proteins 



Name & 
Source 


Sequence 


SEQ ID 
NO: 


F4 Delta 
(human) 


atggtaatcatgctgtaagagcgacagaaattttttaaatattccgtggatgaa 

aagtcagataaagaagcagaagtgtcagaacactccacaggtataacccatct 

tcctcctgaggtaatgctgtcaa 1 i i i cagctatctt/\atcctcaagagttatgt 

cgatgcagtcaagtaagcatgaaatggtctcagctgacaaaaacgggatcgct 

ttg g aaa c a tcttt a cc ctgttc a ttgg gc c a g a ggtg a ctggt at a gtggtcc 

cgcaactgaacttgatactga-acctgatgatgaatgggtgaaaaataggaa.ag 

atgaaagtcgtgci 11 ! catgag i gggatgaagatgctgacattgatgaatctg 

aagagtctgcggaggaatcaattgctatcagcattgcacaaatggaaaaacgt 

tt a ctc c atg gctt a attc a t a a c gttct a cca t atgttggt a cttctgt a a aaa 

ccttagtattagcatacagctctgcagtttccagcaaaatggttaggcagattt 

tagagctttgtcctaacctggagcatctggatcttacccagactgacatttcag 

attctgcatttgacagttggtcttggcttggttgctgccagagtcttcggcatct 

tgatctgtctggttgtgagaaa.atcacagatgtggccctagagaagatttccag 

agctcttggaattctgacatctcatcaaagtggctttttga 

caaaattacttcaactgcgtggaaaaataj\agacattaccatgcagtccacc/\a 

gcagtatgcctgttrgcacgatttaact/v\caagggcattggagaagaaatag 

ataatg a acacccctggacta agcctg i ' i ' i c i 1 ctgag aatttc acttctcctt a 

tgtgtggatgttagatgctgaagatttggctgatattgaagatactgtggaatg 

gagacatagaaatgttgaaag1c1 1 i gtgtaatggaaacagcatcc aactttag 

ttgttccacctctggttg i 1 i'i agtaaggacattgttggactaaggactagtgtc 

tgttggcagcagcattgtgcttctccagcctttgcgtattgtggtcactcai 1 1 1 

gttgtacaggaacagctttaagaactatgtcatcactcccagaatcttctgc.aa 

tgtgtagaaaagcagcaaggactagattgcctaggggaaaagacttaatttac 

tttg g g a gtg a aaaa tctg a tc aa gaga ctg g a cgtgt a cttctgtttctc a gt 

ttatctggatgttatcagatcacagaccatggtctcaggg ii 1 1gactctggga 

ggagggctgccrtatttggagcaccttaatctcrcrggttgtcttactata\ctg 

gtgcaggcctgcaggatttggtttcagcatgtccttctctgaatgatgaatact 

tttactactgtgacaacattaacggtcctcatgctgataccgccagtggatgcc 

agaatttgcagtgtggttttcgagcctgctgccgctctggcgaatgacccttga 

cttctgatci" 1' i g 1 c 1 ac i i cat! 1 agc i gagcaggc 1 1 1ci i 1 catgcactttac 

tcatagcacat i 1 c 1 i gtgttaaccatccc i 1 1 ! igaglgigati i gi 11 tggcc 

ccattrcttacaacttcagaaatcttaatttaccagtgaattgtayxtgttgtttc 

TCTTGCAAATTATAL 1 I 1 1 GG 1 i i AG AAAGGGATTAGGTCTTTTCAAAAGGGTG 
A G AA C A GTCTT A C A TTTTTCTTTT A A A TG A AA TGCTTT A A A G Ay\ TGTTG GT A-\ T 
G C C A TGTC A F A A A CiT A TTTC A T A G A T A A TTTTG A GTTTT AAA G TCC A TG G A G 
GTG A TTGG i 1 C J C 1 1 1 A C AC A TT AAC A CFG TA CCA A GCTTTG C A G A TCTTTTC C G 
ACACACATGTCTGAAGACTTA 1 1 I 1 C AAAGACAGCACA 1 1 1 1 IGGAAACTAATC 
TC! II lCCGTAAiAl 11CC1I i ATTTC A A TG A TTCTC AGAAGGCC AA TTC AA A C A 
AACCCACATTl'AAGGTTCTTTAGGATTATAGAATAAATTGGCTTCTGAGTGTTA 
GCTCAGTGAGTAGGAAAGCACCAATCGATA I"l IGI I I cctttagggatactttg 
TTCTCACCACTGTCCCrATGTCATCAAATTTGGGAGAGA i l l 1 1 1 AAAATACCAC 
A.A TC A TTTG A A G A AA TG T A T AA A" f.Ay\A ATCT A CTTTG A GG A CTTT A CC A AG T A .A - 


SEQ ID 
NO:56 



It is clear from the above that the present invention provides F-box proteins 
and other compositions and methods for the development of compositions capable of 
affecting proteolysis, as well as methods and compositions for the investigation and 
-identification_of_ addition^F^bo^ proteins. 



- 119 - 



WO 99/13989 



PCTAJS93/21763 



All publications and patents mentioned in the above specification are herein , 
incorporated by reference. Various modifications and variations of the described 
method and system of the invention will be apparent to those skilled in the art without 
departing from the scope and spirit of the invention. Although the invention has been 
5 described in connection with specific preferred embodiments, it should be understood 

that the invention as claimed should not be unduly limited to such specific 
embodiments. Indeed, various modifications of the described modes for carrying out 
the invention which are obvious to those skilled in molecular biology or related fields 
are intended to be within the scope of the following claims. 
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SEQUENCE LISTING 



<110> Harper, Jeffrey W. 

Elledge , Stephen J. 

<120> F-BOX PROTEINS AND GENES 

<130> BCM-03510 

<140> 
<141> 

<150> 08/951,621 
<151> 1597-10-16 

<160> 60 

<170> Patentln Ver. 2.0 

<210> 1 

<211> 42 

<212> PRT 

<213> Hcmo sapiens 



<400> I 

Leu Pro Ala Arc Glv Leu Asp His lie Ala Giu Asn lie Leu Ser Tyr 
1 ~ 5 . - 10 15 

Leu Aso Aia Lvs Ser Leu Cvs Ala Ala Glu Leu Val Cys Lys Giu Trp 
20 25 30 

Tyr Arg.Val Thr Ser Asp- Gly Met Leu Trp 

3 5 _ 40 " . - _ . . 

<210> 2 

<211> 126 

<212> DNA 

<213> Hcmo sapiens 

<:400> 2 

ccgccagctc ggggattgga ncatattgcc gagaacattc tgtcatacct ggatgccaaa 60^ 

tcactacgcg ctgctgaact tgcgtgcaag gaatggtacc cagtcacctc cgacggcacg 120 

ctgtgg L ^ c 

<210> 3 

<211> 38 

<212> PRT 

<213> Homo sapiens v 

<400> 3 

Leu Pro Lys Glu Leu Leu Leu Arg lie Phe Ser Phe Leu Asp He Val 
1.5 10 15 

Thr Leu Cvs Arq Cvs Ala Gin He Ser Lys Ala Trp Asn He Leu Ala 
20 25 30 

Leu Asp Gly Ser Asn Trp 
3 5 
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<210> 4 

<211> 114 

<212> DNA 

<213> Homo sapiens 

<400> 4 

ttacccaaag aactcctgtt aagaatattt tccctcttgg atatagtaac tctgngccga 60 
tgtgcacaga tttccaaggc ttggaacatc ttagccccgg acggaagcaa ccgg 114 

<210> 5 

<211> 38 

<212> PRT 

<213> Homo sapiens 

<400> 5 

Leu Pro Tyr Glu Leu lie Gin Leu He Leu Asn His Leu Thr Leu Pro 
■! 5 ' .10 • 15 

Asp Leu Cys Arg Leu Ala Gin Thr Cys Lys Leu Leu Ser Gin Kis Cys 
20 25 - 30 

Cvs Aso Pro Leu Glh Tyr 
'35 



<210> 6 
<2il> 71 
<212> DNA 

<213> Home sapiens 
<400> 6 

ccaccttatig agec-attea gecgatterg aatcacctta caccaccaga cccgcgcaga e: 
ttagcacaga c 

<210> 7 

<211> 38 

<2I2> PRT 

<213> Mus musculus 

<400> 7 

Leu Pro Tvr Glu Leu He Gin Leu lie Leu Asn His Leu Ser Leu Pro 
1*5 10 .15 

Asp Leu Cvs Arc Leu Ala Gin Thr Cys Arg Leu Leu His Gin His Cys 
2C 25 . 30 

Cys Asp Pro Leu Gin Tyr 
3 5 



<210> 8 V 

<211> 114 

<212> DNA 

<213> Mus musculus 

<400> 8 

ctaccatatg agcccattca actgattctg aaccatctct caccaccaga cccgcgcaga 60 
ccagcccaga cccgcaggcc tctccaccag caccgccgcg accccccgca acaC 11 

<210> 9 

-<211>-38 

<212> PRT 



WO 99/18989 



PCT/US98/21763 



<213> Homo sapiens 
<400> 9 

Leu Pro Thr Asd Pro Leu Leu Leu 
1 " 5 

Asp Leu lie Asn Cys Cys Tyr Val 
20 

Ser His Asn Pro Leu Trp 
35 



lie Leu Ser Phe Leu Asp Tyr Arg 

10 15 

Ser Arg Arg Leu Ser Gin Leu Ser 

25 30 



<210> 10 

<211> 114 

<212> DNA 

<213> Homo sapiens 

<400> 10 

ctgcccaccg accccccgct ccccatctta tcct ttctgg actatcggga :ccaatcaac 60 
tgttgttatg ccagtcgaag acttagccag ctatcaagcc acgatccgct gcgg 114 

<210> 11 

<211> 38 

<212> PRT 

<213> Mus muscuius 

<400> 11 

Leu Pro Thr Aso Pro Leu Leu Leu lie Val Ser Phe Val Asp Tyr Arg 
1^5 10 15 

Asn Leu He Asn Cvs Cys Tyr Vai Ser Arg Ser Val Ser Gin Leu Ser 
2 0 ■ "2 5 . ■ - 3 0 

Thr His Asp Pro Leu Trp 
35 



<210> 12 

<211> 114 

<212> DNA 

<213> Mus muscuius 

<400> 12 

ccacccaccg accccctgcc cctcatagta tccctcgtgg actacaggga cccaaccaa: 60 
tgttgcxatg ctagtcgaag cgtcagccag c:accaaccc acgacccacc gcgg 114 

<210> 13 
<211> 38 

<212> PRT - - - . 

<213> Homo sapiens 

<400> 13 

Leu Pro Pro Giu Val Met Leu Ser lie Phe Ser Tyr Leu Asn Pro Gin 
15 10 15 

Glu Leu Cys Arg Cys Ser Gin Vai Ser Met Lys Trp Ser Gin Leu Thr 
20 25 30 

Lys Thr Giy Ser Leu Trp 



- 123 



WO 99/18989 



PCT/US98/21763 



<210> 14 

<211> 113 

<212> DNA 

<213> Homo sapiens 

ctccccccta aagtaatgcc gtcaattttc agctatctta atcctcaaga gttattcgat 60 
gcagtcaagt aagcatgaaa tggtctcagc tgacaaaaac gggatcgctt tgg 113 

<210> 15 

<211> 38 

<212> PRT 

<213> Mus musculus 

Leu°p-o%ro Glu Val Met Leu Ser He Phe Ser Tyr Leu Asn Pro Gin 
1 * 5 10 15 

G iu Leu Cvs Arg Cys Ser Gin Val Ser Thr Lys Trp Ser Gin Leu Ala 
20 25 30 

Lys Thr Gly Ser Leu Trp 
35 

<210> 16 

<211> 114 

<212> DNA 

<213> Mus musculus 

<400> 15 rn 
cttcctccta aggtaatgct gtccattttc agtcacccta atcctcaaga attgtgtcgg 60 
tgtagtcaag tcagtactaa gtggtcccag "ctggcaaaaa caggatcttt gtgg Ij.* 

<210> 17 
<211> 41 
<212>. PRT 

<213> Homo sapiens 
<400> 17 

Leu P-o Leu Glu Met Leu Thr Tyr He Leu Ser Phe Leu Pro Leu Ser 
1 5 10 15 

Asp Gin Lvs Giu Ala Ser Leu Val Ser Trp Ala Trp Tyr Arg Ala Ala 
20 25 30 

Gin Asn Aia Leu Arg Glu Arg Leu Trp 
35 40 



<210> 18 

<211> 123 

<212> DNA 

<213> Homo sapiens 

<400> 18 

ctgcccctcg aoatactcac atatattctg agcttcctgc ctctgtcaga tcagaaagag 60 
gcctccctcg tgagttgggc ttggtaccgt gctgcccaga atgccccccg ggagaggctg 120 
tgg 
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<210> 19 
<211> 35 
<212> PRT 

<213> Komo sapiens 



<400> 19 

Leu Pro Pro Glu Leu Ser Phe Thr 
1 5 

Asn Leu Cys Leu Ala Ser Cys Val 
.20 



lie Leu Ser Tvr Leu Asn Ala Thr 

10 * 15 

Tro Gin Aso Leu Ala Asn Asp Glu 

25 * 30 



Leu Leu Trp 
35 



<2i0> 20 

<21l3> 105 

<212> DNA 

<213> Homo sapiens 



<400> 20 
tcgcctcc 

gcttcacgcg ttcggcacga ccctgcgaat cacgaacttc tctgg 



ttgcctccug agctaagccc taccatcctg ccctacctga acgcaaccga cctntgcttg 60 



<210> 21 

<211> 35 

<212> PRT 

<213>. Mus musculus 



<400> 21 

Leu Pro Pro Glu Leu Ser Phe Thr lie Leu Ser Tyr Leu Asn Ala lie 
1-5 10 15 

Aso Leu Cys Leu Ala SV3T Cvs Val Trp Gin Asp Leu Ala Asn- Asp Glu 
20 *" 25 30 



Leu Leu Trp 
35 

<210> 22 

<21i> 105 

<212> DNA 

<213> Mus musculus 

<400> 22 

ctgccccccg agccgagccc caccacccca ccccacctgg acgcaaccga cc:ccgccta o0 
gcttcccgtg gccggcaaga acccgccaac gacgaacctc eccgg 10 

<210> 23 

<211> 38 ~ - - ~ - - . . 

<212> PRT 

<213> Mus musculus 



<400> 23 ^ 
Leu Pro Arq Val Leu Ser Val Tyr lie Phe Ser Phe Leu Asp Pro Arg 
15 10 15 

Ser Leu Cys Arg Cvs Ala Gin Val Ser Trp Tyr Trp Lys Ser Leu Ala 
20 .25 30 



Glu Leu Asp Gin Leu~Trp- 
35 
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<210> 24 

<211> 114 

<212> DNA 

<213> Mus musculus 

c^tccaagga cgttatccgc ccacatcttt tccctcccgg acccccggag cccccgccgc 60 
tgtgcacagg cgagctggta ctggaagagc tcggctgagt tggaccagct ctgg 11 

<210> 25 

<211> 38 

<212> PRT 

<213> Homo sapiens 

Leu°^o 5 Ue Asd Val Gin Leu Tyr He Leu Ser ?he Leu Ser Pre His 
1 " ' ^ 5 10 15 

Asn Leu Cys Gin Leu Gly Ser Thr Asn Kis Tyr Trp Asn Glu Thr Val 
20 25 30 

Arc His Pro He Leu * Trp 
35 



<210> 26 

<211> 114 

<212> DNA 

<213> Homo' sapiens 



<400> 26 ^ A 

ctgccgaccg acgcacagcc atacacttcg tcctccctcc caccccacga cccgtgccag 60 
ttgggaagca caaaccacca ccggaacgaa accgcaagac acccaacccc c^gg j.14 



<210> 27 

<211> 40 

<212> PRT 

<213> Homo sapiens 



<400> 27 

Leu Pro Leu Glu Leu Trp Arg Mec lie Leu Ala Tyr Leu His Leu Pro 

- i a ^ ^ 



Asp Leu GIv Arg Cvs Ser Leu Val Cys Arg Ala Trp Tyr Giu Leu lie 
. * 20 * 25 30 

Leu Ser Leu Asp Ser Thr Arg Trp 
3 5 - 40 

<210> 28 
<2il> 120 
<212> DNA 

. <213> Homo sapiens 

<400> 28 £n 
ctccccttga agctgtggcg catgatctta gcctacctgc accttcccga cctgggccgc 60 
cgcagcctgg catgeaggge ctggtatgaa ccgaccccca gcctcgacag cacccgccgg 120 

_<21 0> 29 

<2ii> 33 : : 

<212> PRT 
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<213> Mus musculus 

Leu°P-o 9 Ala Glu He Thr Phe Lys He Phe Ser Gin Leu Asp lie Arg 
1 5 10 lo 

Ser Leu Cys Arg Ala Ser Leu Thr Cys Arg Ser Trp Asn Asp Phe Lys 
20 25 30 

Ser 

<210> 30 

<211> 90 

<212> DNA 

<213> Mus musculus 

ctgcctgcag aaatcacttt taaaattrtc agtcagctgg acattcggag tctgcgcagg 60 
gcctcattga catgcaggag cnggaatgac . 

<210> 31 

<211> 38 

<212> PRT 

<213> Mus musculus 

Leu°^o X Leu Leu Gin Gin Pro Leu Leu Cys Ser Val Ala His Pro He 
1 ■ 5 10 15 

Ala Ser Phe Thr Met Leu Ser Tyr Leu Thr Gly Lys Glu Ala Ala His 
20 25 • 30 

Leu Ser Val Glu Leu Trp ..*"■_. 
35 ■ " - . - . 



<210> 32 

<211=> 114 

<212> DNA 

<213> Mus musculus 

<400> 32 

ctcccaicac tgcagcagcc acccccgtcc tccgcggccc accccaccgc cagcctcac 
acgccgtcat accccacggg aaaggaggcc gcccacctgc cagcggagtc gtgg 

<210> 33 
<211> 38 
<212> PRT 

<213> Mus musculus 
<400> 33 

Leu Pro Asp Ser Leu Val Tyr Gin He Phe Leu Ser Leu Gly Pro Ala 
1 5 .10 15 

Asp Val Leu Ala Ala Gly Leu Val Cys Arg Gin Trp Gin Ala Val Ser 
20 .25 30 

Arg Asp Glu Phe Leu Trp 
35 
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<210> 34 

<21i> 114 

<212> DNA 

<213> Mus musculus 



ct^ccgaca occttgtcta ccagatcttc ctgagtttgg gccctgcaga tgtgctggct 6 0 
gctgggltgg tatgccgcca atggcaggct gcgccccggg acgagttcct acgg 1x4 



<210> 35 

<211> 31 

<212> PRT 

<213> Mus musculus 



r!eu°Pro 5 Glu Glu Val Leu Ala Leu lie Phe Arg Asp Leu Pro Leu Arc 



<400> 35 
»u 

1 

Asd Leu 



5 



10 



Ala Val Ala Thr Arg Val Cys 



Arc Ala Trp Ala Ala Ala 



20 



25 30 



<210> 36 

<211> 93 

<212> DNA 

<213> Mus musculus 

ccgccagagg aagtgccggc actcacctcc cgcgacccgc cc«caggga ccccgccgta 60 
gccaccagag tctgcagggc ccgggcggcg get 

<210> 37 

<211> '38 

<212> PRT 

<213> Mus musculus 

Leu°P-o 7 Ser Val Pro Met Met' Glu He Leu Ser Tyr Leu. As? Ala Tyr 
1 5 10 15 

Ser Leu Leu Gin Ala Ala Gin Val Asn Lys Asn Trp Asn Glu Leu Ala 
20 

Ser Ser Asn Val Leu Trp 
35 



25 30 



<210> 38 
<211> 114 

<212> DNA " " - • - . _ 

<213> Mus musculus 

ttacctagtg tgecgatgat ggaaatcccc tcctatctgg atgcctacag cctgccacag 60 
gctgcccaag tgaacaagaa ctggaatgaa ettgeaagea gcgatgccct gtgg 11. 

<210> 39 

<211> 38 

<212> PRT 

„<213j> Mus musculus 
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Me?°?ro 9 Ser Glu He Leu Val Lys He Leu Ser Tyr Leu Asp Ala Val 
1 



5 10 15 



Thr Leu Val Cys He Gly Cys Val Ser Arg Arg Phe Tyr His Leu Ala 

25 30 



20 

Asp Aso Asn Leu He Trp 
. 35 

<210> 40 

<211> 114 

<212> DNA 

<213> Mus mus cuius 

aJgccaccca aaaccttggt gaagatactt tcctacctgg atgcggcgac cccggcgcgc 60 
a ^ ggacg r- cg agcagacg cttccatcat ttggccgatg acaatccca: utgg 1x4 

<210> 41 
<211> 43 
<212> PRT 

<213> Homo sapiens 

Leu^ro^ec Glu Val Leu Met Tyr lie Phe Arg Trp Val Val Ser Ser 
1 '5 10 15 

Asd L^u Asd Leu Arg Ser Leu Glu Gin Leu -Ser Leu Vai Cys Arg Gly 
' ~ " 20 25 30 

Phe Tvr He Cvs Ala Arg Asp Pro Glu lie Trp 

* " 35 ■ 40 " ... 

<210> 42 

<211> 129 

<212> DNA 

<213> Homo sapiens 

ccgccaatgc acgccctcat otacatcccc cgatgggtgg cgtccagcga cttggacctc 60 
agatcaccgg agcagttgcc gccggtgcgc agagggctct acacccgcgc cagagacccC i20 
gaaacacgg 

<2±Q> 43 
<211> 18 
<212> PRT 

<213> Mus mus cuius 

Leu°Ser 3 Leu Val Cvs Arg Gly Phe Tyr He Cys Ala Arg Asp Pro Glu 
1 *5 10 15 

He Trp 
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<210> 44 
c211> 81 
<212> DNA 
<2I3> Mus mus cuius 

<400> 44 

gactcggacc tcagaccgrc agagcagttg tcactggtgt gcagaggatt ccatatctgt oO 
gccagagacc ccgaaatctg g 51 

<210> 45 

<211> 31 

<212> PRT 

<213> Kcmo sapiens 

<400> 45 

Leu Fro Tyr Glu Leu Ala lie Asn He Phe Xaa Tyr Leu Asp Arg Lys 
1 5 10 15 

Glu Leu Glv Arg Cys- Ala Gin Val Ser Lys Thr Trp Glu Gly Asp 
20 .25 30 



<210> 45 

<211> 93 

<212> DNA 

<213> Homo sapiens 

<400> 46 

ccgccttacg aactggcaat caacacattt agtacctgga caggaaagaa ccaggaagac 60 
gcgcacaggn gagcaagacg tgggaaggcg ate 53 

<210> 47 

<211> 38 

<212^ PRT 

<213> Komo sapiens 

<400> 47 

Leu Pro Leu Glu Leu Lvs Leu Arg He Phe Arg Leu Leu Asp Val Arg 
1 • 5 10 IS 

Ser Val Leu Ser Leu Ser Ala Val Cys Arg Asp Leu Phe Thr Ala Ser 
20 25 30 

Asn Asp Pro Leu Leu Trp 
35 



<210> 48 
<211> 114 

<212> DNA • . 

<213> Homo sapiens 

<400> 48 

ctcccattgg aactgaaact aeggatctte cgacttccgg atgctcgctc cgtcttgtct 60 
tcgtccgcgg tctgccgcga cccctttacc gctccaaacg acccacccct gtgg H 

<210> 49 

<211> 38 

<212> PRT 

<213> Mus musculus 
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<400> 49 

Leu Pro Leu Glu Leu Lys Leu Arg 
i . 5 

Ser Val Leu Ala Leu Ser Ala Val 
20 



lie Phe Arg Leu Leu Asp Val His 
10 15 

Cys His Aso Leu Leu lie Ala Ser 
25 * 30 



Asn Asd Pro Leu Leu Trp 
35 

<210> .50 

<211> 114 

<212> DNA 

<213> Mus musculus 

<400> 50 

cctccactgg agctgaaact acgcatcttc cgacttctgg atgttcattc tgtcctggcc 60 
ctgtctgcag tctgtcatga cctcctcact gcgccaaacg acccactgcc gtgg 114 

<210> 51 
<211> 456 
<212> PRT 

<213> Homo sapiens 
<400> 51 

Ser Ala Met Val Phe Ser Asn Asn Asp Giu Gly Leu He Asn Lys Lys 
1 5 ' 10 15 

Leu Pro Lys Glu Leu- Leu Leu Arg He Phe Ser Phe Leu Asp lie Val 
20 25 30 

Thr Leu Cvs Arg Cys Ala Gin lie 'Ser Lys Ala Trp 'Asn He Leu Ala 
35 40 -~.\ 45. 

Leu Asd Gly Ser Asn Trp Gin Arg lie Asp Leu Phe Asn Phe Gin He . 
50 55 60 

Asp Val Glu Gly Arg Val Val Glu Asn He Ser Lys Arg Cys Gly Gly 
65 70 75 80 

Phe Leu Arg Lys Leu Ser Leu Arg Gly Cys He Gly Val Gly Asp Ser 
85 90 95 

Ser Leu Lys Thr Phe Ala Gin Asn Cys Arg Asn He Glu His Leu Asn 
100 105 110 

Leu Asn Glv Cys Thr Lys- He Thr Asp Ser Thr Cys Tyr Ser Leu Ser 
115 120 " 125 

Arg Phe Cys Ser Lys Leu Lys His Leu Asp Leu Thr~Ser Cys Val Ser~ 
130 135 140 

He Thr Asn Ser Ser Leu Lys Gly He Ser Glu Gly Cys Arg Asn Leu 
145 150' 155 160 

Glu Tyr Leu Asn Leu Ser Trr> Cvs Ast> Gin He Thr Lys Asp Gly He 
165 " ' 170 175 

Glu Ala Leu Val Arg Gly Cys Arg Gly Leu Lys Ala Leu Leu Leu Arg 
180 IS 5 19 0 
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Glv Cvs Thr Gin Leu Glu Asp Glu Ala Leu Lys Kis He Gin Asn Tyr 
1 1 195 200 205 

Cvs H^s Glu Leu Val Ser Leu Asn Leu Gin Ser Cys Ser Arg He Thr 
210 215 220 

Asn Glu Gly Val Val Gin He Cys Arg Gly Cys His Arg Leu Gin Ala 
225 230 235 240 

Leu Cvs Leu Ser Gly Cys Ser Asn Leu Thr Asp Ala Ser Leu Thr Ala 
245 250 255 

Leu Glv Leu Asn Cys Pro Arg Leu Gin He -Leu Glu Ala Ala Arg Cys 
260 265 270 

Ser His Leu Thr Asp Ala Gly Phe Thr Leu Leu Ala Arg Asn Cys His 
275 280 235 

Glu Leu Glu Lys Met Aso Leu Glu Glu Cys He Leu He Thr Asp Ser 
290 295 300 

Th- Leu He Gin Leu Ser He His Cys Pro Lys Leu;Gin Ala Leu Ser 
305 310 315 320 

Leu S^y s Cys Glu Leu He Thr Asp Asp Gly lie Leu His Leu Ser 
325 330 335 

Asn Se^- Th*- Cvs Gly His Glu Arg Leu Arg Val Leu Glu Leu Asp Asn 
340 345 350 

Cys Leu Leu He Thr Aso Val Ala Leu Glu His Leu Glu Thr Ala Glu 
355 " 360 365 

Ala Tip Ser Ala Ser Ser Cys ' Thr -Thr' Ala Ser Arg Leu Pro Val Gin 
370 375 380 

Ala Ser Ser Gly Cys Glv Leu Ser Ser Leu Met Ser Lys Ser Thr Pro 
385 390 395 400 

Thr L<=u Leu Pro Ser Pro His Arg Gin Gin Trp Gin Glu Val Asp Ser 
405 410 415 

Asp Cvs Ala Gly Ala Val Ser Phe Ser Asp Ser Ser Cys Leu Gly Pro 
' " 420 425 430 

Arq Gly Aso Glu Ala Ser Phe Pro Leu Glu Asp Leu Ser Leu Pro Asp 
435 440 445 

Arg Leu His His His Pro He Cys 
4 50 4 55 



<210> 52 

<2ll> 1230 

<212> DNA 

<213> Homo sapiens 



<400> 52 _ 0 
ttcgaccatg ottttctcaa acaatgatga aggccttatt aacaaaaagt tacccaaaga 60 
acttcracta agaatacttt ccttcttgga tatagtaact ttgtgccgat gtgcacacat 120 
- t-t ccaaggGt-tggaacacc.t_tag.ccc tgga^ jcggaagcaac_ tg gcaaag aa ta gatct ttt 18 0 
taactttcaa atagatgtag agggtcgagt ggtggaaaat atctcgaagc gatgcggtgg 240 
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attcctgagg 
ctttgcacag 
tgacagcacg 
ctcctgtgtg 
ggagtacctg 
gcgaggttgt 
agctctgaaa 
ctcacgtatc 
tctctgcctt 
ctgtccgcga 
tacaccttta 
gataaccgac 
ccrgtcccac 
tggccatgag 
cctggaacac 
gttacccgcg 
tactttgctc 



aagctcagct 
aaccgccgaa 
tgctacagcc 
tctattacaa 
aacctctct c 
cgaggcctga 
cacatccaga 
acggatgaag 
tcgggttgca 
ctgcaaattt 
gctcggaatt 
agcacactca 
tgtgaactca 
aggctgcggg 
ctagaaacrg 
caggcatcaa 
ccgtcacccc 



tgcgaggccg 
acatcgaaca 
t tagcagatt 
acagctcct t 
ggtgtgatca 
aagccccgct 
attactgcca 
gtgtggtgca 
gcaaccccac 
tggaggctgc 
gccacgaatc 
cccagctccc 
tcacagatga 
tactggagtt 
ccgaggcctg 
gcggargcgg 
accgacagca 



cattggtgtt 
ttngaacctc 
ctgttccaag 
gaaggggatc 
gaccacgaag 
cctgaggggc 
tgagcttgtg 
gatatgcagg 
agatgcctct 
ccgatgctcc 
ggagaagatg 
cattcactgt 
tgggatcctg 
ggacaactgc 
gagcgcctcg 
gctcagcucc 



ggggattcct 
aatggatgca 
ctgaaacatc 
agtgagggct 
gatggcatcg 
tgcacacagt 
agcctcaact 
ggctgtcacc 
cttacagccc 
catttgactg 
gatcttgaag 
cctaaactgc 
cacccgagca 
ctcctcacca 
agctgtacga 
c:ca£ctcaa 



ccttcaagac 
caaaaatcac 
tggatccgac 
gccgaaaccc 
aggcactggt 
cacaagacga 
tgcagtcctg 
ggctacaggc 
tgggtttgaa 
acgcaggttt 
aacgcacccc 
aagccctgag 
acagtacctg 
ctgatgtggc 
ccgccagcag 
agtccacgcc 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1230 



<210> 53 

<211> 380 

<212> PRT 

<213> Homo sapiens 



<400> 53 
Arg Pro Arg 
1 

Lys Asp Gly 



Val Leu Gly 
3 5 

Glu Leu lie 
50 

Arg Leu Ala 
65 . 

Leu Gin Tyr 
A so Thr Ser 



Leu Asn Leu 
■115 

Phe Ser Arg 
130 

Leu Ser Cys 
145 



Phe Gly Thr Ser Asp 



Cys 
20 

Glu 



Gin 
Gin 
He 



Leu 
100 

Ser 



Phe 



Ser 



Gly Met Asp Ser 



Gly Pro Asn Asn 
• 40 

Leu lie Leu Asn 
55 

Thr Cys Lys Leu 
70 

His Leu Asn Leu 
85 

Glu Phe Leu Gin 



Glu Met Cvs Pro 



Leu Pro Pro 



Gin 
180 



Tro Thr Gly Asn 
120 

Leu Lys Val Cys 
135 

His Phe Leu Asn ■ 
150 

Asn Leu Gin Ala 
165 

Ala Phe Asn His 



lie Glu Asp Asp 

io' 

Leu Asn Lys Lys 
25 

Gly Tyr Phe Asp 



His Leu Thr Leu 
60 

Leu Ser Gin His 
75 

Gin Pro Tyr Trp 
90 

Ser Arg Cys Thr 
105 

Arg Giy Phe He 



Gly Ser Glu Leu 
- . 14.0 

Glu Thr Cys Leu 
155 

Leu Asn Leu Ser 
170 



Ala Tyr Ala Glu 
15 

Phe Ser Ser Ala 
30 

Lys . Leu Pro Tyr 
45 

Pro Asp -Leu Cys 



Cys Cys Asp Pro 
80 

Ala Lys Leu Asp 
95 

Leu Val . Gin Trp 
110 

Ser Val Ala Gly 
125 

Val Arg Leu Glu 



Glu Val He Ser 
160 

Ser Cys Asp Lys 
175 



He Ala Lys Leu 
185 



Cys Ser Leu Lys 
190 



Arg Leu Val 
195 



Leu 



Tyr Arg Thr Lys - 
200 



-Va-i— Giu— G-l-n-T-h-r— Ala— Leu-.Leu_S.er_ 
205 
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lie Leu Asn Phe Cys Ser GIu Leu Gin His Leu Ser Leu Gly Ser Cys 
210 215 220 

Val Met He Glu Asn Tyr Asp Val He Ala Ser Met He Gly Ala Lys 
225 " 230 23S. 240 

Cvs Lvs Lys Leu Arg Thr Leu Asp Leu Trp Arg Cys Lys Asn He Thr 
245 250 255 

Glu Asn Gly He Ala Glu Leu Ala Ser Gly Cys Pro Leu Leu Glu Glu • 
260 265 270 

Leu Asd Leu Gly Trp Cys Pro Thr Leu Gin -Ser Ser Thr Gly Cys Phe 
" 275 280 285 

Thr Arg Leu Ala Kis Gin Leu Pro Asn Leu Gin Lys Leu Phe Leu Thr 
290 295 300 

Ala Asn Arg Ser Val Cvs Aso Thr Asp He Asp Glu Leu Ala Cys Asn 
305 310 " 315 320 

Cys Th^- Arg Leu Gin Gin Leu Asp He Leu Gly Lys Val Thr He Tyr 
325 330 335 

Lys Phe Val Leu Asn Val Cvs Phe Leu Asp Arg Lys Ala Asn Leu Arg 
340 345 350 

Leu Phe Val Arg Lys Lys Lys He Phe Gly Tyr Asn Lys Asn Phe lie 
355 360 365 

Leu He Arc: Tro Leu Gly Leu lie Gly Asn Ala Arg 
370 " '* 375 3S0 



<210> 54 
<211> 1380 
<212> DNA 

<213> Homo sapiens 
<400> 54 

aggccaagat tcggcacgag tgatatagaa gatgatgcct acgcagaaaa ggatggttgc 60 

ggaatggaca gtcttaacaa aaagutcagc agtgctgtcc tcggggaagg gccaaataat 120 

gggtatttcg ataaactacc tcatgagctt attcagctga tcctgaatca tcccacacta 180 

ccagacccgc gtagactagc acagacccgc aaactactga gccagcattg ccgtgatcc: 240 

ctgcaataca cccacctcaa cccgcaacca tactgggcaa aactagacga cacctctecg 300 

gaatttctac agtctcgctg cactctcgtc cagtggctta atttatcttg gactggcaat 360 

agaggcttca tctccgttgc aggacccagc aggcccctga aggtttgtgg acccgaacca 420 

gtacgccttg aactgtcttg cagccac'ttt cttaatgaaa cttgcttaga age tact tec 480 

gagacgtgtc caaatctaca ggcctcaaat ctctcctcct gegataaget accacctcaa 540 

gctttcaacc acattgecaa gt tatgeagc. ct.caaacgac ccg.cccccca tcgaacaaaa 600 

gtagagcaaa cagcaccgct cagcatttcg aacttctgtt cagagcttca gcacctcagc 660 

ttaggcagtt gtgtcacgat tgaagactac gatgegatag ctagcacgat aggagecaag 720 

cgcaaaaaac tccggacccc ggacccgcgg agatgeaaga acatcactga gaatggaaca 780 

geagaacegg ctcctgggcg cccaccaccg gaggagcttg accctggccg gtgcccaacc 840 

ctgeagagea gcaccgggcg cttcaccaga ctggcacacc agctcccaaa ettgeaaaaa 900 

ctctttctta cagctaacag aCctgtgtgc gacacagaca ccgatgaatc ggcaegtaac 960 

tgcaccaggt cacaacagcc ggacatatta ggtaaggtta caatatataa atctgttcca 1020 

aatgtctgtc tccttgacag aaaagccaat ctcagacttt ctgttaggaa aaagaaaacc 1080 

Ctcggataca ataaaaacct tatcctgaca agaeggcttg gcccgacagg aaacgecaga 1140 



- 134 - 



WO 99/18989 PCT/US 98/2 1763 

tagaccagtt aatataggga ataattatat atgcacttta ataaaacagt gaggacaata 1-200 

acaactttat agttgaactg taaaaaacca caaccattaa ttcttggtct acttgtaaga 1260 

gtgagaaft acatgagctg cactctctat: ttttattaag gagagaagaa attaattcat 1320 

ttgtataatg aatccaagct agtttttttc aagtttctta attaagcggc cgcaagctta 1380 

<210> 55 
<211> 519 
<212> PRT 

<213> Homo sapiens 

<400> 55 „ . 

Met Val He Met Leu Glx Glu Arg Gin Lys Phe Phe Lys Tyr Ser Val 

* 10 15 



Asp Glu Lys Ser Asp Lvs Glu Ala Glu Val Ser Glu His Ser Thr Gl} 
20- * * 25 30 

He Thr His Leu Pro Pro Glu Val Met Leu Ser He Phe Ser Tyr Leu 
35 ' 40 45 

Asn Pro G"» n Glu Leu Cys Arg Cvs Ser Gin Val Ser Met Lys Trp Ser 
50 55 60 

Gin Leu Thr Lvs Thr Glv Ser Leu Trp Lys His Leu Tyr Pro Val His 
65 * 70 75 80 

T rD a 1 a Arq Glv Asp Tro Tyr Ser Glv Pro Ala Thr Glu Leu Asp Thr 
85 " ' ' .90 95 

Glu Pro Asp Asd Glu Trp Val Lys Asn Arg Lys Asp Glu Ser Arg Ala 
100 105 110 

Phe His Glu Trp Asp Glu Asp Ala -Asp lie Asp Glu Ser Glu Glu Ser 
115 — " " 120 - - 125 

Ala Glu Glu Ser He Ala He Ser He Ala Gin Met Glu Lys Arg Leu 
130 135 140 

Leu His Gly Leu He His Asn Val Leu Pro Tyr Val Gly' Thr Ser Val 
145 150 155 ISO 

Lys Thr Leu Val Leu Ala Tyr Ser Ser Ala Val Ser Ser Lys Met Val 
165 170 175 

Arg Gin lie Leu Glu Leu Cvs Pro Asn Leu Glu His Leu Asp Leu Thr 
180 * 135 190 

Gin Thr Asp He Ser Asd- Ser Ala Phe Asp Ser Trp Ser Trp Leu Gly 
195 * 200 205 

Cys Cvs Gin Ser Leu Arg His Leu Asp Leu Ser Gly Cys ~Glu" Lys He 
210 215 220 

Thr Asp Val Ala Leu Glu Lys He Ser Arg Ala Leu Gly He Leu Thr 
225 ' 230 235 240 

Ser His Gin Ser Gly Phe Leu Lys Thr Ser ' Thr Ser Lys He Thr Ser 
245 250 255 

Thr Ala Trp Lvs Asn Lvs Asp He Thr Met Gin Ser Thr Lys Gin Tyr 

2 60 ! 26 5 : 270 
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Ala Cys Leu His Asd Leu Thr Asn Lys Gly He Gly Glu Glu He Asp 
275 " 280 285 

Asn Glu His Pro Trp Thr Lys Pro Val Ser Ser Glu Asn Phe Thr Ser 
290 295 300 

Pro Tyr Val Trp Met Leu Asd Ala Glu Asp Leu Ala Asp He Glu Asp 
305 310 315 320 

Thr Val Glu Tro Arg His Arg Asn Val Glu Ser Leu Cys Val Met Glu 
325 330 33-5 

Thr Ala Ser Asn Phe Ser Cys Ser Thr Ser Gly Cys Phe Ser Lys Asp 
340 345 350 

He Val Gly Leu 'Arg Thr Ser Val Cys Trp Gin Gin His Cys Ala Ser 
3-55 360 365 

P-o Ala Phe Ala Tyr Cvs Giv His Ser Phe Cys Cys Thr Gly Thr Ala 
370 \ 375 380 

L AT - a Thr M ^ t ser Ser Leu Pro Glu Ser Ser Ala Met Cys Arg Lys 
335 " 390 395 400 

Ala Ala Arc Thr Arg Leu Pro Arg Gly Lys Asp Leu He Tyr Phe Gly 
405 410 415 

Se- GTu Lvs Ser Asp Gin Glu Thr Gly Arg Val Leu Leu Phe Leu Ser 
' 420 425 430 

Leu Ser Gly Cvs Tyr Gin He Thr Asp His Gly Leu Arg Val Leu Thr 
435 "* 440 445 

Leu Gly Gly Gly Leu Tro Tyr Leu; Glu His -Leu Asn Leu Ser Gly Cys 
450 * 455 460 

' Leu Thr ll» Thr Gly Ala Glv Leu Gin Asp Leu Val Ser Ala Cys Pro 
.465 * " 470 * 475 480 

Ser Leu Asn Asp Glu Tvr Phe Tyr Tyr Cys Asp Asn He Asn Gly Pro 
485 ' 490 495 

His Ala Asp Thr Ala Ser Glv Cys Gin Asn Leu Gin Cys Gly Phe Arg 
500 505 510 

Ala Cys Cys Arg Ser Gly Glu 
515 



<210> 56 
<211> 2276 
<212> DNA 

<212> Homo sapiens 



atggtaatca tgctgtaaga gcgacagaaa ttctttaaat autccgtgga tgaaaagtca 60 . 

aataaagaag cacaaatgtc agaacacccc acaggtacaa cccatcttcc tcctgaggta 120 

atgctgtcaa tctccagcta tcttaaccct caagagttat gtcgatgcag ccaagtaagc 180 

atgaaatggt ctcagctgac aaaaacggga tcgctttgga aacatcttta ccctgttcat 240 

tgggccagag gcoactggta tiagtggtccc gcaactgaac ttgatactga acctgatgat 300 
oaatcQgcga'aaaatagbTa^ 

gacattgatg aatctgaaga gtctgcggag gaatcaatcg ctatcagcat cgcacaaacg 420 
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gaaaaacgtt 

aaaacctcag 

gagctttctc 

tttgacagtt 

tgtgagaaaa 

tctcatcaaa 

aataaagaca 

aagggcattg 

aatttcactt 

actgtggaat 

tntagttgtt 

tgttggcagc 

acaggaacag 

gcagcaagga 

gatcaagaga 

gaccatggtc 

ctcrctggtt 

tctctgaatig 

gccagtggat 

cccttgactt 

tactcatagc 

atttctcaca 

aattatacrt 

atttttcttc 

ttcatagata 

cactgtacca 

acagcacacc 

cagaaggcca 

cttctgagtg 

tactttgttc 

acaatcacct 



tactccatgg 
tattagcata 
ctaacctgga 
qctcttggct 
tcacagatgt 
gtggcttttt 
ttaccatgca 
gagaagaaat 
ctccttatgt 
ggagacatag 
ccacctctgg 
agcattgcgc 
ctttaagaac 
ctagattgcc 
ctggacgtgt 
tcagggtttt 
gtcttactat 
atgaatactt 
gccagaattt 
ccgatctttg 
acatttcttg 
actccagaaa 
tcggt.tt.aga 
taaatgaaat 
attttgagtt 
agctttgcag 
tttggaaacc 
at tcaaacaa 
ttagctcagt 
tcaccactgt 
gaagaaa tgt 



cttaattcat 

cagctctgca 

gcatctggat 

tggttgctgc 

ggccctagag 

gaaaacatct 

gtccaccaag 

agataatgaa 

gtggatgtta 

aaatgttgaa 

ttgttttagt 

ttctccagcc 

tatgtcatca 

taggggaaaa 

acttctgttt 

gactctggga 

aactggtgca 

ttactactgt 

gcagtgtggt 

tctacttcat 

tgttaaccat 

tcttaattta 

aagggattag 

get t taaaga 

ttaaagtcca 

atctttcccg 

aatctctttt 

acccaca 1 1 1 

gagtaggaaa 

ccctatgtca 

ataaataaaa 



aacgttctac 

gt ctccagca 

cttacccaga 

cagagtcttc 

aagatttcca 

acaagcaaaa 

cagtatgect 

cacccctgga 

ga tgctgaag 

agtctttgtg 

aaggacattg 

tttgegtatt 

ctcccagaat 

gacttaattt 

ctcagtttat 

ggagggctgc 

ggectgeagg 

gacaacatta 

tttcgagect 

ttagctgagc 

ccctttttga 

ccagtgaa tt 

gtcttttcaa 

atgt t'ggtaa 

tggaggtgat 

acacacatgt 

cegtaatatt 

aaggttcttt 

gcaccaatcg 

tcaaatttgg 

tctactttga 



catatgttgg 
aaatggttag 
ctgacatttc 
ggcatcttga 
gagctcttgg 
ttacttcaac 
gtttgeaega 
etaagectgt 
atttggctga 
taatggaaac 
ttggactaag 
gtggtcactc 
ettctgeaat 
actttgggag 
ctggatgtta 
cttatttgga 
atttggtttc 
acggtcctca 
cctgccgccc 
aggctttctt 
gcgtgacttg 
gcaatgttgt 
aagggtgaga 
tgccatgtca 
tggttctctt 
ccgaagac tt 
tcctttat tt 
aggattatag 
atatttgtct 
gagagatt tt 
ggactttacc 



tacttctgta 
gcagatttta 
agattctgea 
tctgtctggt 
aattctgaca 
tgcgtggaaa 
tttaactaac 
ttcttctgag 
tattgaagat 
agcatccaac 
gactagtgtc 
attttgttgt 
gtgtagaaaa 
tgaaaaatct 
tcagatcaca 
gcaccttaat 
agcatgtcct 
tgetgatace 
tggcgaatga 
teatgeact: 
ttttggcccc 
ttctcttgea 
. acagtcttac 
t ttaaagtac 
tacacattaa 
attttcaaag 
caatgattct 
aataaattgg 
cctttaggga 
t taaaatacc 
aagtaa 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1360 
■ 1S20 
1980 
2040 
2100 
2160 
2220 
2276 



<210> -57 

<2ii> 39 : "■ 

<212> FRT 

<213> Homo sapiens 

<400> 57 

r. eu p ro Leu Glu Leu Ser Phe Tyr Leu Leu Lys Trp Leu Asp Pro Gin 
15 10 15 

Thr Leu Leu Thr Cys Cys Leu Val Ser Lys Gin Trp Asn Lys Val lie 
20 25 30 



Ser Ala Cvs Thr Glu Val Trp 
35 



<210> 58 

<211> 117 . __ . _ 

<212> DNA 

<213> Homo sapiens 

<400> 58 

cttcccctgg agctcagttt ttatttgtta aaatggctcg atcctcagac tttactcaca 60 
tgctgcctcg tctctaaaca gtggaataag gtgataagtg cctgtacaga ggtgtgg 11 

<210> 59 
<211> 10 
<212> DNA 

— <21\3 > ~Ar z -i-f i c i-a 1— S e quence : 
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<220> . , , ^ . 

<223> Description of Artificial Se^ence: Synthetic 

<400> 59 
aattcgcgcg 

<210> 60 

<211> 21 

<212> PRT 

<213> Artificial Sequence 

<220> • * 

<223> Description of Artificial Sequence: Synthetic 

<400> 60 ^ 
Lvs Lvs Giu Arg Leu Leu Asd Aso Arg His Asp Ser Gly Leu Asp Ser 
1 5 10 15 

Met Lys Asd Glu Glu 
20 
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CLAIMS 

What is claimed is: 

1. An isolated polypeptide comprising at least one functionally active fragment of 
an F-box protein. 

2. The isolated polypeptide of Claim 1. wherein said F-box protein is mammalian 
F-box protein. 

3. The isolated- polypeptide of Claim 1. wherein said functionally active fragment 
consists of the amino acid sequence selected from the amino acid sequences set forth in SEQ 
ID NOS:I. 3 9 5. 9, 13, 17. 19, 25. 27, 4L 45 ; 47, 51, 53, 55, and 57. 

4. The isolated polypeptide of Claim 1, wherein said functionally active fragment 
consists of the amino acid sequence selected from the amino acid sequences set forth in SEQ 
ID NOS:7. 11. 15. 21, 23. 29, 31 ; 33. 35, 37, 39, 43, and 49. - 

5. A purified antibody which binds to the isolated polypeptide of Claim 1. 

6. An isolated* nucleotide sequence encoding at least one functionally active 
fragment of an F-box protein. 

7. The isolated nucleotide sequence of Claim 6, wherein said F-box protein is a 
mammalian F-box protein. 
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8. The isolated nucleotide sequence of Claim 6, wherein said sequence consists of 
at least a portion of the sequence set forth in SEQ ID NOS:2, 4, 6. 10. 14, 18. 20, 26. 28, 42, 
48, 52, 54. 56, and 58. 

9. The isolated nucleotide sequence of Claim 6, wherein said sequence consists of 
at least a portion of the sequence set forth in SEQ ID NO:8, 12, 16, 22, 24, 30, 32 ? 34, 36, 
38. 40, 44. and 50. 

10. A vector comprising a nucleotide sequence, wherein the nucleotide sequence 
comprises the nucleotide sequence of Claim 8. 

11. A vector comprising a nucleotide sequence, wherein the nucleotide sequence 
comprises the nucleotide sequence of Claim 9. 

12. A host cell transformed with the vector of Claim 10. 

13. A host cell transformed with the vector of Claim 1 1. 

14. A first polynucleotide sequence comprising at least fifteen nucleotides, which 
hybridizes under stringent conditions to at least a portion of a second polynucleotide sequence, 
wherein said second polynucleotide sequence is selected from the polynucleotide sequences set 
forth in Claim 8. - - . _ . . __ 
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15. A method for detection of polynucleotides encoding F-box protein in a 
biological sample comprising the step of hybridizing at least a portion of a polynucleotide 
encoding an F-box protein to nucleic acid of a biological sample ; thereby forming a 
hybridization complex. 

16. The method of Claim 15. further comprising the step of detecting said 
hybridization complex, wherein the presence of said complex correlates with the presence of a 
polynucleotide encoding F-box protein in said biological sample. 

17. The method of Claim 15. wherein before hybridization, said nucleic acid of 
said biological sample is amplified by the polymerase chain reaction. 

18. A method for the detection of F-box protein targets comprising the steps of: 

a) providing an F-box prbtei-n._ and a sample suspected of containing an F-box 
protein target; and 

b) exposing said F-box protein to said sample under conditions such that said F- 
box protein binds to said F-box protein target to form an F-box/target complex. 

19. The method of Claim 18. further comprising the step of detecting said F-box 
protein/target complex. 

- 20. The method of Claim 17 ? wherein said F-box protein target is selected from the 
group consisting of cyclins, cyclin-dependent kinases, and IkB. 

21. The method of Claim 17 ? further comprising the step of observing said F-box 
protein/target complex for degradation of said F-box protein target. 

22. The method of Claim 17 ? further comprising the step of exposing said F'-box 
protein and F-box protein target to an F-box protein antagonist. 
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23. The method of Claim 20. wherein said F-box protein antagonist prevents the 
formation of said F-box protein and said target complex. 

24. A method for the detection of an F-box protein and Skpl complex, comprising 

5 the steps of: 

a) providing an F-box protein, and Skpl; and 

b) exposing said F-box protein to said Skpl under conditions such that said F-box 
protein binds to said Skpl to form an F-box protein/Skpl complex. 

10 25. The method of Claim 24. further comprising the step of detecting said F-box 

protein/Skpl complex. _ . 

26. The method of Claim 24, wherein step b) further comprises exposing said F- 
box protein and said Skpl to an F-box protein. antagonist; 

15 

27. The method of Claim 24 ; wherein said F-box protein antagonist prevents the 
formation of said F-box protein/Skpl complex. 

28. A method for the detection of one or more NF-kB regulatory factors 
20 comprising the steps of: 

a) providing a slimb protein, and a sample suspected of containing one or more 
- - - NF-kB regulatory factors; and 

b) exposing said slimb protein to said sample under conditions such that said 
slimb protein binds to said one or more NF-kB regulatory factors to form a 

25 slimb/reguiatory factor complex. 

29. The method of Claim 28, further comprising the step of detecting said 



slimb/regulatory factor complex. 
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30. The method of Claim 28. further comprising the step of observing said 
slimb/resulatory factor complex for degradation of said one or more NF-kB regulatory factors. 

31. The method of Claim 28. further comprising the step of exposing said slimb 
5 protein and one or more NF-kB regulatory factors to an F-box protein antagonist. 

32. The method of Claim 31. wherein said F-box protein antagonist prevents the 
formation of said slimb/regulatory factor complex. 

10 33. A method for the detection of a slimb protein complex, comprising the steps of: 

a) providing a slimb protein and a sample suspected of containing one or more 
proteins capable of forming a complex wich said slimb protein; and 

b) exposing said slimb protein to said one or more proteins capable of forming a 
complex _with said slimb protein .under conditions such that said slimb protein 

15 binds to said one or more proteins capable of forming a complex with said 

slimb protein to form a slimb protein complex. 

34. The method of Claim 33. further comprising the step of detecting said slimb 
protein complex. 
20 ' 

- 35. The method of Claim 33. wherein step b) further comprises exposing said slimb 

protein and said one or more -proteins -capable of forming a complex with said slimb protein 
to an F-box protein antagonist. 

25 36. The method of Claim 35 : wherein said F-box protein antagonist prevents the 

formation of said slimb protein complex. 
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FIGURE 1 
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FIGURE 2 
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FIGURE 2 (cont.) 
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FIGURE 2 (cont.) 
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FIGURE 3 
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FIGURE 3 (cont.) 
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FIGURE 3 (cont.) 
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FIGURE 3 (cont.) 
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FIGURE 3 (cont.) 
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FIGURE 4 
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FIGURE 4 (cont.) 
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FIGURE 5 
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FIGURE 5 (cont.) 
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FIGURE 5 (cont.) 
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FIGURE 6 
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FIGURE 6 (cont.) 
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FIGURE 7 
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FIGURE 8 
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FIGURE 9 
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FIGURE 10 
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FIGURE 11 
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FIGURE 12 
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